Forum Moderated by: open

Crawler, Spider, and User Agent ID


Forum to identify search engine spiders and user agents

 
Thread SubjectMessagesStarted byLast Message
  Bing/1.1 CFNetwork/459
3 wilderness 6:38 pm May 24, 2010
  Deluged with unknown bots
19 montclairguy 5:56 am May 23, 2010
  Unknown bot scrape or probe?
6 enigma1 8:05 am May 18, 2010
  Googlebot now hiding its UA
4 enigma1 3:18 pm May 17, 2010
  New archive.org UA
includes heritrix :)
13 caribguy 2:52 am May 17, 2010
  .b2b3a
7 wilderness 8:59 pm May 14, 2010
  Missing HTTP HOST
No value at all in HTTP_HOST
9 dstiles 8:30 pm May 14, 2010
  80legs
[3] ( 1 2 3 )
61 GaryK 7:18 am May 12, 2010
  Proxy IT
2 keyplyr 5:41 pm May 11, 2010
  Twitter Chasing Bots
How many bots are chasing your tweets?
10 incrediBILL 5:52 am May 9, 2010
  SERPAnalytics
yet another SEO scraper
4 incrediBILL 10:25 pm May 8, 2010
  SmartViper
another SEO scraper
5 incrediBILL 12:43 am May 8, 2010
  TweetmemeBot
2 incrediBILL 10:28 pm May 7, 2010
  80legs on the crawl
5 incrediBILL 6:15 am May 6, 2010
  Strange Requests from Googlebot
9 aristotle 8:23 pm May 5, 2010
  downforeveryoneorjustme Revisited
(incl. AppEngine-Google)
4 Pfui 12:24 am May 5, 2010
  Blocking Cuil bots
8 Asia_Expat 4:40 am May 4, 2010
  Apple iPad UA ID
3 Pfui 5:09 am Apr 28, 2010
  MSN's many cloaked bots.
Mass undocumented activity in search.msn.com ranges[2] ( 1 2 )
42 Pfui 8:28 am Apr 27, 2010
  Protoype
Cloaked somethingorother from .us.ibm.com
5 Pfui 3:09 pm Apr 26, 2010
  Interesting Google-bot Image Encounter
3 caribguy 3:08 pm Apr 25, 2010
  Blank User agent string
Blank User agent string query
4 baiwan 6:50 pm Apr 24, 2010
  Yellow Pages (heritrix)
2 Pfui 9:48 pm Apr 21, 2010
  blocking user agents - 403
UA continues request
5 smallcompany 3:04 am Apr 21, 2010
  Anyone know the Basefarm user agent name?
3 internetheaven 9:22 pm Apr 20, 2010