Forum Moderators: goodroi

Message Too Old, No Replies

Yahoo Ignoring robots.txt

Anyone know how to get Yahoo....

         

marketingmagic

5:09 pm on Nov 28, 2005 (gmt 0)

10+ Year Member



Hi - Anyone know how to get Yahoo to obey the robots.txt?

We run a site with an american .com and a canadian .ca version of the site. Since it's the same content, etc.. and we don't want to get the duplicate content gods angry we've put a ban on all crawlers for the .ca site. It would appear that Yahoo's not obeying this and has our .ca site ranking for some obscure terms.

The file has been up for about 3-4 months now, so anything that was there has had plently of time to go away.

Does anyone have any suggestions for dealing with Yahoo not obeying this file? They claim to on their site, but the listings in the index say otherwise.

jdMorgan

8:23 pm on Nov 29, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



How did you 'put a ban on' slurp from .ca domains?

And in what form are your .ca URLs showing up in Yahoo search?

If you Disallowed them with robots.txt, and your .ca pages are showing up as URL-only listings, (possibly using link-text as the title) then that's to be expected, since they can include your URLs in their index by following incoming links (and the link text associated with those links) without actually fetching your pages.

There's also the possibility that you've got a syntax problem in your robots.txt file, but we can't be sure without a posted sample.

Jim