Forum Moderated by: open

Crawler, Spider, and User Agent ID


Forum to identify search engine spiders and user agents

 
Thread SubjectMessagesStarted byLast Message
  Identifying bots via ip and host
Identifying bots via ip and host
30 enigma1 5:20 pm Nov 29, 2008
  CFNetwork again.
More junk from Yahoo?
7 GaryK 7:31 pm Nov 28, 2008
  User Agent = Feedfetcher-Google
17 smallcompany 5:02 pm Nov 28, 2008
  GurujiBot/1.0
Obeyed Robots.txt
3 mcneely 2:31 am Nov 28, 2008
  DomainCrawler
What purpose does it serve?
11 GaryK 9:30 pm Nov 26, 2008
  UA's starting with "User-Agent:"
Redundant UA declaration from MSIE look-alikes
4 caribguy 11:17 am Nov 25, 2008
  SkyGrid
2 keyplyr 2:12 am Nov 25, 2008
  Voracious
2 keyplyr 11:13 am Nov 24, 2008
  BTWebClient
5 koan 5:00 pm Nov 23, 2008
  Personifi heads up
HEAD request with Wget/1.10.2 and then GET with Mozilla/5.0 X11
8 caribguy 8:52 pm Nov 21, 2008
  More netsweeper.
I feel the urge to 86 216.171.96.nnn - 216.171.111.nnn
3 caribguy 8:25 pm Nov 19, 2008
  Intersting UA
13 wilderness 5:13 pm Nov 18, 2008
  Strange new "no-referrer" traffic
16 Scarecrow 4:05 pm Nov 18, 2008
  Mozilla/5.0 (compatible; OpenX Spider; http://www.openx.org)
Posed as this then crawled as Nutch
9 GaryK 5:43 pm Nov 17, 2008
  WebTV
Ballmer's rival for YouTube?
6 Samizdata 3:51 am Nov 17, 2008
  Bots Attack Using Randomized User Agents
Simply Blocking Libwww-PERL Won't Work
12 incrediBILL 1:06 am Nov 17, 2008
  1813
how you're doing with it today?
4 smallcompany 10:38 pm Nov 16, 2008
  BeB-cart
Why is a shopping cart crawling my site?
3 GaryK 5:33 pm Nov 15, 2008
  lnbot
FAST Enterprise Crawler 6 used by LexisNexis
4 caribguy 2:19 pm Nov 12, 2008
  Japanese AV bot?
3 Megaclinium 9:02 pm Nov 10, 2008
  copyright sheriff
Another rights enforcer?
3 GaryK 10:59 pm Nov 9, 2008
  is this Googlebot legit?
"Mozilla/5.0 (compatible; Googlebot/2.1; http://www.google.com/bot.html)"
6 Nkona 7:31 pm Nov 9, 2008
  Mozilla/5.0 (compatible; Google Keyword Tool; +https://adwords.google.
Anyone see this bot crawling lately
14 trinorthlighting 3:15 pm Nov 9, 2008
  Fake Googlebot from SoftLayer
creepy creepy crawly crawly
26 Samizdata 11:36 pm Nov 8, 2008
  Google Home Page as Referrer and Random User Agents
Is this a scraper?
9 dataguy 9:00 pm Nov 8, 2008