Forum Moderated by: open

Crawler, Spider, and User Agent ID


Forum to identify search engine spiders and user agents

 
Thread SubjectMessagesStarted byLast Message
User-Agent: Custom-AsyncHttpClient
(asked for a lot of crap)
3 SumGuy 4:43 am Sep 9, 2024
Four-Minute Bytes
Dive with me into the AWS weeds for a moment...
3 Pfui 12:58 am Sep 6, 2024
meta-externalagent
externalhotlink by another name?
2 lucy24 9:42 pm Sep 5, 2024
Redundant \ bot
3 not2easy 9:13 pm Sep 1, 2024
Yet another "Cloud" Server with rogue bots
140.248.0.0/16
4 Bewenched 2:16 am Aug 30, 2024
Why is it that when I see GET /.env or POST /index.html
It's always from a Microsoft Azure IP?
1 SumGuy 12:46 am Aug 30, 2024
Nextcloud Server Crawler
Seen today, a rare sighting
1 SumGuy 11:45 pm Aug 18, 2024
mozilla
What will they think of next?
9 lucy24 11:42 am Aug 13, 2024
PerplexityBot
Perplexing
5 Pfui 6:39 am Aug 12, 2024
PS Daily
1 lucy24 9:25 pm Aug 11, 2024
ChatGPT / openai bot
First time I'm seeing them
3 SumGuy 7:51 am Aug 10, 2024
Anyone else seeing a lot more of Applebot?
Since the beginning of 2024, Applebot has been crawling my sites more often
19 sudo 11:55 pm Aug 6, 2024
Will Googlebot cache JS after crawling it?
5 Jean_Niu 2:55 am Jul 16, 2024
Facebook hitting my server from new IP's
Why exactly?
2 SumGuy 5:16 pm Jul 12, 2024
Seeing googlebot hits from new IP range
192.178.6.x - new to me
5 SumGuy 8:25 pm Jul 6, 2024
OpenWebSearch.eu Crawler OWLer
7 engine 12:12 pm Jun 29, 2024
Redirection error
7 chainazo 9:19 pm Jun 11, 2024
A few Android user-agents I'm blocking, and why
from google and microsoft IP's
1 SumGuy 1:49 pm Jun 1, 2024
New duckduck bot
1 dstiles 8:04 am Jun 1, 2024
New bot or crawler with UA containing BW/1.2; rb.gy/oupwis
From a google cloud IP
6 SumGuy 3:30 pm May 31, 2024
Looks like Nvidia is scraping web content
3 SumGuy 11:05 pm May 28, 2024
Using Sec- to block scrapers
Sec-Fetch and Sec-Ch-Ua
29 dstiles 5:17 pm Apr 21, 2024
Server Farms 2024
Continuing discussion of hosting and data center IP ranges
7 not2easy 3:39 pm Apr 21, 2024
Digital Ocean has new IP's
6 SumGuy 2:55 pm Apr 14, 2024
Seeing binance dot com showing up as a referrer
UA is always Chrome/90.0.4430.85
2 SumGuy 2:01 am Apr 12, 2024