Forum Moderated by: open

Crawler, Spider, and User Agent ID


Forum to identify search engine spiders and user agents

 
Thread SubjectMessagesStarted byLast Message
Server Farms - July 2018
Continuing discussion of hosting company IP ranges[5] ( 1 2 3 4 5 )
139 keyplyr 5:54 pm Jan 17, 2020
Info.com crawler
3 ClosedForLunch 8:55 pm Jan 10, 2020
UnChaos
does not follow robots.txt
2 notriddle 6:14 pm Jan 9, 2020
fake googlebot?
from a comcast IP address
3 SumGuy 1:26 am Jan 9, 2020
RDDocuments
2 Pfui 6:17 am Jan 8, 2020
BuiltWith
6 lucy24 2:24 am Dec 29, 2019
Status 101
2 tangor 8:40 pm Dec 28, 2019
woorank
2 lucy24 2:29 am Dec 10, 2019
Corax
3 tangor 6:02 pm Dec 7, 2019
Bot Mashing
5 rivsrush 2:00 pm Dec 6, 2019
Linespider
9 lucy24 11:16 am Nov 21, 2019
Any humans come from there?
*.compute.hwclouds-dns.com
4 blend27 6:02 am Nov 17, 2019
ApiTool
7 Pfui 2:26 am Nov 17, 2019
Python, Curl and Robots.txt
17 dstiles 8:48 pm Nov 12, 2019
whisper
14 lucy24 10:00 am Oct 20, 2019
Return / continuation of msnbot
search.msn.com/msnbot.html
11 dstiles 9:58 am Oct 20, 2019
Google hiccup
3 wilderness 9:33 am Oct 14, 2019
Hi
Pages exploit UA
15 Pfui 5:26 am Oct 5, 2019
URI: ./ ./ mnt/ custom/ ProductDefinition
DVR remote code execution
3 Pfui 5:35 pm Sep 27, 2019
Yisou again
2 lucy24 12:06 am Sep 27, 2019
GrumpyCrawler
Microsoft range
3 Pfui 6:02 pm Sep 24, 2019
Mb2345Browser/9.0
I'm pretty sure this is actually a crawler, not a browser
8 notriddle 2:51 am Sep 10, 2019
serpstatbot/1.0 (advanced backlink tracking bot
3 notriddle 9:28 pm Sep 3, 2019
Moreover
4 lucy24 1:18 am Sep 2, 2019
BingPreview from Facebook
4 lucy24 8:50 pm Sep 1, 2019