Forum Moderators: open

Message Too Old, No Replies

Yahoo-MMCrawler/3.x

and robots.txt

         

bull

6:36 am on Oct 7, 2004 (gmt 0)

10+ Year Member




User-agent: Yahoo-MMCrawler
Disallow: /

User-agent: MMCrawler
Disallow: /

User-agent: Yahoo-MMAudVid
Disallow: /

does not help - the bot does not even read robots.txt itself. I am now using .htaccess to keep it out. What is the correct entry in robots.txt?

This behaviour is a shame for a major search company like Yahoo.

WebGuerrilla

1:29 am on Oct 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member




Have you verified that the IP it's coming from is indeed owned by Yahoo?

It's not uncommon for rogue bots to run around using other bots UA's.

abates

1:45 am on Oct 8, 2004 (gmt 0)

10+ Year Member



Yahoo's help pages say:

User-agent: Yahoo-MMCrawler

bull

4:50 pm on Oct 9, 2004 (gmt 0)

10+ Year Member



Have you verified that the IP it's coming from is indeed owned by Yahoo?

Of course - 66.94.233.85 (please edit if deprecated). It is even not uncommon that Yahoo bots don*t check robots.txt with their proper User-agent, as confirmed in forum11 by that Yahoo guy who is around here. I have enabled the MMCrawler to get robots.txt, but nothing else - why should I write e-mails to the indicated address for a non robots.txt compliant bot?

Regards from Rome