Forum Moderators: open

Message Too Old, No Replies

What is the name the Yahoo image bot?

or is it a seperate bot?

         

Iczer

4:08 pm on Apr 6, 2006 (gmt 0)

10+ Year Member



After finally getting out of Google images I am now all over Yahoo images. I am unable to find what bot they are using in my logs. Just wondering the name of the bot Yahoo uses so I can ban it.

Aside: I don't allow outside linking to the pictures on my site so when one clicks on the picture in Yahoo images you get an error that says "The image you are trying to view is either temporarily unavailable or may no longer exist." and then gives a menu listing of the entire directory. This is almost more annoying then having people leech the pictures and bandwidth.

wilderness

2:42 pm on Apr 7, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



This is almost more annoying then having people leech the pictures and bandwidth.

Either have your host/server configure files properly or add a blank index.html to your image folder.

Pfui

1:46 am on Apr 8, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



User-agent: Yahoo-MMCrawler

There you go, from Yahoo's Help pages for their Image Search [help.yahoo.com]:

How do I remove my copyrighted image from the results? [help.yahoo.com]

Just in case, I include these two related entries in my robots.txt --

User-agent: Yahoo-MMCrawler
User-agent: Yahoo-MMCrawler/3.x
Disallow: /

And to belt-and-suspenders your safeguards, you can also include:

User-agent: *
Disallow: /your-image-directory

Good luck!

Iczer

2:06 pm on Apr 12, 2006 (gmt 0)

10+ Year Member



Both suggestions implimented. Thanks for the help!

Tastatura

2:15 pm on Apr 12, 2006 (gmt 0)

10+ Year Member



Options suggested by wilderness and Pfui are good ones, however you might have the blank index page indexed by SEs – not sure about this. Another option is to add this to your .htaccess file:

Options All -Indexes

So any time someone requests dir they will get ‘403 Forbidden’

HTH