Forum Moderators: Robert Charlton & goodroi
Likewise, I have two identical pages, one has URL with lang=en parameter and one without (neither blocked by robots.txt).
I have put canonical tag on pages with lang=en to point to the URL without lang=en parameter, however WMT still reports these two pages as having duplicate titles / descriptions.
I would imagine that
a) if a page is blocked in robots.txt, it should not report duplicate title / description in WMT?
b) if a page has canonical tag implemented, it should not come as duplicate title/description with the page the canonical tag points to?
I have verified that the pages ARE blocked in robots.txt and they are also listed under "pages blocked by robots.txt in WMT.
Or is my understanding wrong?
* Was that url always in robots.txt from the time it first went live, or did you add it to robots.txt a bit later?
* Have you validated the syntax of your robots.txt file?
But if the url is no longer allowed, then whatever kind of loose information is being reported in the WMT pages, there should be nop effect on the ranking of your allowed pages. Yes, you're right about the canoncial tag - it's clearly no worry even if the WNT reports duplicates across several urls with the same canonical tag.
[edited by: tedster at 9:55 pm (utc) on April 26, 2009]
I guess all I can do now is wait and see if duplicate title / descriptions disappear from content analysis after some time has passed.
BTW, I have verified robots.txt (I used a tool in WMT and it confirmed the URL is excluded via robots.txt) and even more, the page itself appears as an entry under "Pages disallowed by robots.txt" in the WMT overview, so obviously, Google has picked up the info that this page should not have been crawled!