Forum Moderators: open
InfociousBot (+http://corp.infocious.com/tech_crawler.php)
First requested the robots.txt from a site wich was on a subdomain. This site has been offline for quite some time and has been returning 404's eversince.
The robots.txt has a complete exclude using:
User-agent: *
Disallow: /
yet, they happily continued requesting pages. About 50 of them, wich ofcourse all resulted in a 404.
All happened in about 2 minutes time.
Have fun banning them :)
I know I did :)