Forum Moderators: phranque

Message Too Old, No Replies

What do web hits look like from users with Google / Amazon hardware?

Devices like Alexa, google assistant, etc

         

SumGuy

2:43 am on Jun 23, 2020 (gmt 0)

5+ Year Member Top Contributors Of The Month



I'm pretty militant when it comes to looking through my web log files and blocking organizations and net-ranges (by IP) from being able to reach my web server. They give themselves away by looking for non existent files (ie wordpress) or they are just scraping my site from cloud hosters, etc.

When it comes to google, I block IP ranges that come back as "google user content". When it comes to Amazon, I block all of Amazon unless I need to keep a small range something open (like letsencrypt).

Because I don't own a cell phone and hence don't own or use devices like Alexa and Assistant (and I guess maybe Apple has something similar?) I have no idea if these devices intercept all web traffic between users and web-sites and maybe some of the traffic I'm blocking are people trying to hit my site through Alexa or Assistant and they don't know or can't bypass the device and just hit me directly? I'm just speculating here because I don't know how these devices operate at the home network / routing level. Otherwise all the google user content hits and AWS hits i see (or rather, that I block) is just junk?

dstiles

8:42 am on Jun 23, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I block all google ranges except for the crawl range (66.249/16) and all amazon ranges with a few small, individual-ip holes for duckduckgo. There may be good bots on one of the blocked ranges but so far I haven't noticed one.

I use letsencrypt with this setup and haven't had a problem; I didn't thing they operated from google or amazon ranges.

Only thing I've had to occassionally drll holes for has been SOME aspects of the HTTPS validator suite Mozilla Observatory.