homepage Welcome to WebmasterWorld Guest from 107.22.45.61
register, free tools, login, search, subscribe, help, library, announcements, recent posts, open posts,
Accredited PayPal World Seller

Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

This 137 message thread spans 5 pages: 137 ( [1] 2 3 4 5 > >     
Search Engines Agree on "Canonical tag"
youfoundjake




msg:3848490
 3:44 am on Feb 13, 2009 (gmt 0)

Not sure where to put this, but since google search is moderated, a mod will put where necessary.

Today it was annouced that the 3 big search engines have come up with a new tag to help with canoncial issues.
Announcements:
[googlewebmastercentral.blogspot.com ]
[ysearchblog.com ]
[blogs.msdn.com ]

Using the new canonical tag

Specify the canonical version using a tag in the head section of the page as follows:

<link rel="canonical" href="http://www.example.com/product.php?item=swedish-fish"/>
That’s it!

You can only use the tag on pages within a single site (subdomains and subfolders are fine).
You can use relative or absolute links, but the search engines recommend absolute links.
This tag will operate in a similar way to a 301 redirect for all URLs that display the page with this tag.

Links to all URLs will be consolidated to the one specified as canonical.
Search engines will consider this URL a “strong hint” as to the one to crawl and index.

< See also Canonical Tag Results: Share the stories - Positive / Negative / No Impact [webmasterworld.com] >

[edited by: tedster at 6:08 pm (utc) on April 2, 2009]

 

caribguy




msg:3848537
 7:17 am on Feb 13, 2009 (gmt 0)

Maybe I'm missing something, but wouldn't it have been simpler to accomplish this by using <base href="http://www.example.com" />

garlicjr




msg:3848540
 7:32 am on Feb 13, 2009 (gmt 0)

Collectively, how many millions of hours have been spent by developers trying to get around this problem? This is a trivial solution that they could have put in place a long time ago.

Note to self - I wish someone would write a book about how much economic damage is done by dumb ass programming and lame software.

tedster




msg:3848555
 7:48 am on Feb 13, 2009 (gmt 0)

This solution addresses all kinds of canonical problems [webmasterworld.com], not just with-www and no-www. It even lets the search engines know not to index certain parameters might be added to the url - session ids and so on.

From the Yahoo article:
<link rel=”canonical” href=”http://www.example.com/products” />

The above tag indicates to the crawler that the URL it is present on should be represented canonically as http://www.example.com/products. This would eliminate the following duplicates:

http://www.example.com/products?trackingid=feed
http://www.example.com/products?sessionid=hgjkeor2
http://www.example.com/products?printable=yes&trackingid=footer


caribguy




msg:3848566
 7:58 am on Feb 13, 2009 (gmt 0)

Thanks for setting me straight Ted, I should read before posting :)

dhaliwal




msg:3848590
 8:22 am on Feb 13, 2009 (gmt 0)

good to see some action

Robert Charlton




msg:3848595
 8:29 am on Feb 13, 2009 (gmt 0)

Regarding the <base> link, the Google Webmaster Central Blog article discusses how the <base> link can be used together with the rel="canonical" if you choose to use relative rather than absolute paths....

...relative paths are recognized as expected with the <link> tag. Also, if you include a <base> link in your document, relative paths will resolve according to the base URL.

That said, absolute paths are recommended for the canonical tag.

AjiNIMC




msg:3848602
 8:33 am on Feb 13, 2009 (gmt 0)

I think it will address the https issue as well, which we had discussed a lot.

Good to see some action from GYM.

smallcompany




msg:3848611
 8:45 am on Feb 13, 2009 (gmt 0)

So, this is not "will" but can be used as of now...

steveb




msg:3848612
 8:45 am on Feb 13, 2009 (gmt 0)

If this is okay

<link rel="canonical" href="http://www.example.com/page.html" />

is this okay?

<link rel="canonical" href="http://www.example.com/page.html">

Removing the xml trailing slash?

caribguy




msg:3848630
 9:10 am on Feb 13, 2009 (gmt 0)


Not just okay, the slash is forbidden...

[w3.org...]

steveb




msg:3848638
 9:16 am on Feb 13, 2009 (gmt 0)

I got a invalid warning when I tried to validate the version with a slash. the version without a slash validated.

koan




msg:3848674
 10:21 am on Feb 13, 2009 (gmt 0)

I got a invalid warning when I tried to validate the version with a slash. the version without a slash validated.

Doesn't it depend on your Doctype? no slashie for HTML 4.0, slashie for XHTML 1.0.

m0thman




msg:3848690
 10:43 am on Feb 13, 2009 (gmt 0)

Interesting, can I ask what may be a silly question though? I've often considered developing a more mobile friendly version of my site but been worried about duplicate content issues (among other things). Would this be an ideal use for the new canonical tag?

woldie




msg:3848720
 11:43 am on Feb 13, 2009 (gmt 0)

An important development from the search engines. I was wondering will this eliminate/fix the duplicate title and meta tags data from Google Webmaster Tools?

johnser




msg:3848724
 11:55 am on Feb 13, 2009 (gmt 0)

Am I right in thinking therefore that a link such as
www.domain.com/landing.php?aff=123

will now be viewed by Google as
www.domain.com

(assuming the tag is used of course)
If so, this will have major positive implications for merchants....

Receptional Andy




msg:3848725
 11:56 am on Feb 13, 2009 (gmt 0)

Bah. Another proprietary element to fix an underlying technical problem.

That said, it's in the toolbox. I'm curious as to whether this would work for pagination, but the FAQ implies otherwise:

Is it okay if the canonical is not an exact duplicate of the content?

We allow slight differences, e.g., in the sort order of a table of products. We also recognize that we may crawl the canonical and the duplicate pages at different points in time, so we may occasionally see different versions of your content. All of that is okay with us.

Page 2 is obviously going to be substantially different from page one, but this element would be ideal for that scenario.

httpwebwitch




msg:3848745
 12:44 pm on Feb 13, 2009 (gmt 0)

>>Another proprietary element to fix an underlying technical problem

Well said - it's like robots.txt, nofollow, sitemap.xml, etc. The search engines own the Internet now, so why not redecorate to suit their taste.

I get what this means, but I wonder what it means, right? I guess it's a lazy way to imply a 301 redirection without actually redirecting, for content publishers who don't have access to modify the HTTP headers. I'm going to spend the rest of the week imagining ways this could be abused.

wildbest




msg:3848746
 12:45 pm on Feb 13, 2009 (gmt 0)

OMG, life can be so simple! :)

skipfactor




msg:3848759
 1:27 pm on Feb 13, 2009 (gmt 0)

A handy tool for webmasters without the means to rewrite. Might free-up some of jdMorgan's time too. ;)

Asia_Expat




msg:3848766
 1:47 pm on Feb 13, 2009 (gmt 0)

Right... and this means I can drop a part of my htaccess that might otherwise have used up resources, or complicated other htaccess rules.

[edited by: Asia_Expat at 1:48 pm (utc) on Feb. 13, 2009]

Miamacs




msg:3848771
 2:08 pm on Feb 13, 2009 (gmt 0)

Is rel="canonical" a hint or a directive?
It's a hint that we honor strongly. We'll take your preference into account, in conjunction with other signals, when calculating the most relevant page to display in search results.

...

... honor it... take your preference into account...

Bentler




msg:3848818
 3:14 pm on Feb 13, 2009 (gmt 0)

This will be very useful all around. Thanks for passing it along. The full url is important for sites that allow a page to be requested with multiple urls, which some do-- and it would be nice to add a canonical in these cases.

I wonder if this would be useful for tagging a print version to reference its screen version for indexing too.

johnnie




msg:3848827
 3:38 pm on Feb 13, 2009 (gmt 0)

This is not a solution

There, I got your attention. Using this tag is merely fighting symptoms, instead of properly addressing the underlying problem. The problem in this case being poor site structure and/or server configuration. I'm already appalled by the large lazy bunch that will see this tag as an excuse to ignore the task of properly structuring their content and correctly configuring their server. The canonical tag should, in my opinion, be viewed as a supplementary tool to prevent any unwanted duplication in cases like campaign tagging.

Poor structure is not only confusing to search engines, but also to users (and linkerati)

[edited by: johnnie at 3:42 pm (utc) on Feb. 13, 2009]

supafresh




msg:3848832
 3:40 pm on Feb 13, 2009 (gmt 0)

So if I put this tag on every page of my site because of our extensive partner network is a source of major duplicate content. Would the 3 search engines see this as abuse?

Will Google handle this the same way it does with other blocked pages and sometimes add the physical URL in the SERPS?

What is the difference between this and the robots="NOINDEX " tag.

How is the link juice distributed via the canonical tag? Would these links be devalued or not counted at all because of the canonical tag?

I'm very hesitant to release the hounds at the moment.

tonynoriega




msg:3848839
 3:57 pm on Feb 13, 2009 (gmt 0)

Ok, maybe someone can help me with my issue. (keep in mind i starte here only 5 months ago... so the site framework was in place pre-me)

site is built around asp and .net

alot of pages within the site are linked as "/goodtimes/bettertimes.asp?id="

relative links.

we use a CMS so there are several templates that users can grab to create a page, and they all have the: <base target="_top"></base> which is for frames right?

can i use the base tag to implement this new tag?

essentially, i am finding that in my Google Webmaster Tools they see:

index.asp?id=
index.asp

as two separate pages on various pages throughout my site...which i would like to avoid.

how should i implement this new tag? or should i at all?

Asia_Expat




msg:3848844
 4:08 pm on Feb 13, 2009 (gmt 0)

The one thing I can't do with robots.txt wildcards, is prevent blank query strings WITHOUT affecting my forum software. The only way would be to list every other directory as a 'disallow' in the robots file, or with a potentially resource intensive htaccess rule.

I've not looked at the details yet, but I think this is a well thought out addition to our toolkit (ignoring underlying technological problems of course).

[edited by: Asia_Expat at 4:12 pm (utc) on Feb. 13, 2009]

Tonearm




msg:3848852
 4:20 pm on Feb 13, 2009 (gmt 0)

Do we all agree that fixing canonical issues "for real" is better than using this tag? I think this is bad news for anyone who has fixed all canonical issues on their site. More SE exposure for the lazy competition.

pageoneresults




msg:3848862
 4:30 pm on Feb 13, 2009 (gmt 0)

Do we all agree that fixing canonical issues "for real" is better than using this tag?

You've got my vote and I say we put the onus on those providing the hosting services. This is something that should be part of a basic package these days but yet many hosts are freakin clueless, especially Windows hosting providers. Bunch of lazy cheap individuals who don't want to take the little bit of time that is involved to protect their network of hosting clients. Dingbats!

Okay, so this covers the BIG 3, what about 4, 5, 6, 7, 8, 9 and 10? Did we forget about those? What about any other crawler/bot that is indexing? How come Ask wasn't included in this? They are in a close race with that #3 position. ;)

I sit here and try to imagine what happens in the overall scheme of things. The BIG 3 adhere to these standards. The others could care less. Now you've got all these other indexing entities doing who knows what. How does that effect the BIG 3's view of the target domains?

Ya, a band-aid for sure but it does help quite a few with the BIG 3 only. One of these days Webmasters will realize the importance of addressing issues such as this at the root of the cause. Not 2, 3, or 4 steps into the process. I have a feeling that this leaves the door open for a bit of confusion when looking at the global aspects of it.

More like Canonical Tag Soup. ;)

wheel




msg:3848878
 4:54 pm on Feb 13, 2009 (gmt 0)

Do we all agree that fixing canonical issues "for real" is better than using this tag? I think this is bad news for anyone who has fixed all canonical issues on their site. More SE exposure for the lazy competition.

+1 to that.

Fix the issues yourself. That helps you first and foremost without leaving any footprints.

I refuse to do anything specifically for the SE's benefit. I don't dance to their tune, they dance to mine. And specialty tags or attributes that are search engine defined falls into the camp of dancing to their tune.

Make it part of something like the W3 standard and maybe I'd think about it. But why would standards' bodies do something specifically for three commercial companies? They won't of course.

This 137 message thread spans 5 pages: 137 ( [1] 2 3 4 5 > >
Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About
© Webmaster World 1996-2014 all rights reserved