Forum Moderated by: open

Crawler, Spider, and User Agent ID


Forum to identify search engine spiders and user agents

 
Thread SubjectMessagesStarted byLast Message
Defining good bad bots
4 smallcompany 7:08 am Feb 29, 2008
Home brewed spider?
216.237.31.*** scraping my site
6 Megaclinium 2:17 am Feb 29, 2008
spider analysis software
3 gdawg 12:31 am Feb 28, 2008
Exalead = Abovenet Communications, Inc.
193.47.80.xx and now 64.124.148.xx
5 Bewenched 11:34 pm Feb 26, 2008
Rogue bot from microsoft
Msrbot
5 marodhum 7:35 pm Feb 25, 2008
Baidu Image Spider
Is it valid?
5 GaryK 11:02 am Feb 24, 2008
Teemer from NetSeer
20 keyplyr 10:00 am Feb 24, 2008
Return of Yahoo! Slurp/3.0
Last seen from Inktomi
6 jdMorgan 1:42 pm Feb 23, 2008
google tried to load ./hack.com
reason for trying hack.com within the site
4 smallcompany 5:07 pm Feb 22, 2008
Yahoo now violating robots.txt
... and heading for a complete ban from our sites
20 Mokita 3:54 am Feb 21, 2008
military IP and php attack
6 smallcompany 4:18 pm Feb 20, 2008
Yahoo Java Crawler and Mod Security
yahoo mod security and
10 frontpage 3:12 pm Feb 15, 2008
What Google tool
3 wilderness 2:56 pm Feb 13, 2008
208.111.154.**
Kavam / Limelightnetworks
8 Bewenched 3:08 am Feb 13, 2008
Naverbot
good or bad
5 Bewenched 10:28 pm Feb 12, 2008
DataCha0s
A little more info...
2 mcneely 5:31 pm Feb 12, 2008
VadixBot
2 Hobbs 5:28 pm Feb 12, 2008
Newbie question
**.nat.svl.kavam.net , spider indentification
2 jim_knopf 5:26 pm Feb 12, 2008
Strange?inx=<URL> page requests
Any ideas?
2 ofnimira 5:23 pm Feb 12, 2008
Heritrix
sending my own domain as their referral?!
2 Bewenched 5:19 pm Feb 12, 2008
Blackspider?
recently showed up
12 mcneely 2:01 pm Feb 9, 2008
Blue Communication AS
Majestic-12?
15 Bewenched 1:36 am Feb 9, 2008
MultiCrawler
Doesn't obey robots.txt
13 Mokita 3:02 pm Feb 6, 2008
Are there "bot networks"?
Controlled by the same person or company
10 Reno 5:49 pm Feb 5, 2008
Getting traffic from llnw.net
2 JohnKelly 3:58 am Feb 4, 2008