Forum Moderators: goodroi

Message Too Old, No Replies

Master List of Malicious Robots

Is there an active list of the bots we should block?

         

Eric_Lander

8:26 pm on Aug 11, 2004 (gmt 0)

10+ Year Member



I'm looking to find a good resource that keeps an active list of the bad robots up to date. I'm working on a large dynamic site with lots of images and content... And I know if the wrong bots come through, it would cause a lot of harm to the site owners.

Does such a list exist? I used to use an application that was kept up to date, but the creators have since gone out of business.

DaveAtIFG

5:20 pm on Aug 12, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Here's a looooonnnng discussion [webmasterworld.com] that includes a pretty comprehensive list. And our Search Engine Spider Identification [webmasterworld.com] discusses individual crawlers in detail.

Eric_Lander

11:19 am on Aug 13, 2004 (gmt 0)

10+ Year Member



Very cool Dave... Much thanks!

helohelo

3:40 pm on Aug 25, 2004 (gmt 0)

10+ Year Member



mmm, i dont understand, this list has to do with httacces, or vcan it also be used for robots.txt

And how malicious can they be? When is a robot on this list

Dreamquick

4:09 pm on Aug 25, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



A truely malicious robot will just ignore everything you have in robots.txt so no, you can't stop all malicious robots using robots.txt, you need to use some kind of logic or check to catch them.

What robots.txt does allow you to do is exclude the nuisance/semi-legitimate 'bots that are willing to follow your instructions.

- Tony