Forum Moderated by: open

Crawler, Spider, and User Agent ID


Forum to identify search engine spiders and user agents

 
Thread SubjectMessagesStarted byLast Message
  ucsb seclab crawler
8 Pfui 9:02 pm Jan 22, 2012
  Vancouver 0.9
3 MxAngel 10:45 am Jan 20, 2012
  SpiderLing
2 Pfui 6:58 am Jan 18, 2012
  Gigabot Revisited
4 Pfui 5:53 am Jan 18, 2012
  obot
IBM iss.net IPs
5 Pfui 4:55 am Jan 16, 2012
  EuripBot
3 keyplyr 8:46 pm Jan 14, 2012
  MS Crawler Hiding as a Browser
14 incrediBILL 4:19 am Jan 11, 2012
  Is this a valid user agent? and how do I block it?
Is this a valid user agent? and how do I block it?
12 spiritualseo 4:59 am Jan 10, 2012
  Nook
2 Pfui 7:22 pm Jan 8, 2012
  Servers using open proxies
4 dstiles 3:21 pm Jan 7, 2012
  boia.org
tools request, ignore robots.txt
2 Pfui 7:06 am Jan 7, 2012
  Googlebot-richsnippets
2 Pfui 2:37 am Jan 7, 2012
  MFE expand
3 Pfui 12:44 am Jan 7, 2012
  Vocus
Hocus Pocus
2 Pfui 11:17 pm Jan 6, 2012
  Kindle & Amazon IPs?
28 keyplyr 8:45 pm Jan 5, 2012
  Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)
9 Pfui 8:41 pm Jan 5, 2012
  Automatically detect crawlers / bots
13 Globetrotter 3:03 pm Jan 5, 2012
  Semrushbot/0.9
5 tangor 11:12 pm Jan 4, 2012
  yolinkBot
4 keyplyr 7:06 pm Jan 3, 2012
  MSNBot has become a constant Fast-Scraper
7 IPs crawling at max 12 pages / sec - this is out of order[2] ( 1 2 )
42 AlexK 8:28 pm Jan 1, 2012
  GoogleDocs
4 Pfui 3:09 am Dec 31, 2011
  DDDD DDDD human?
8 lucy24 2:39 am Dec 29, 2011
  Packet One
2 keyplyr 5:07 pm Dec 24, 2011
  Awstats Scrapers / Injectors
19 dstiles 11:59 pm Dec 19, 2011
  New scraper? portalimage.org
9 brokaddr 11:27 am Dec 17, 2011