Forum Moderators: open

Message Too Old, No Replies

Gotta love this one from Amazon

They'll stop crawling depending on my attitude!

         

GaryK

8:02 am on Sep 28, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



AISearchBot (Email: aisearchbot@gmail.com; We love searching and we'll stop crawling immediately depending on your attitude.)
75.101.240.68
ec2-75-101-240-68.compute-1.amazonaws.com

I wonder what my attitude should be considering they didn't read robots.txt and tried to crawl non-existent files? Hmmm... They'll be getting an e-mail with bad attitude from me and then I'll force them to stop crawling my sites, :).

jdMorgan

7:52 pm on Sep 29, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



This is Amazon's "rent additional servers and high-performance internet connectivity by the day, week, or month to suit your immediate server performance needs" service; They term it their "compute cloud."

The problem is that *anyone* can potentially use it for *anything*, including crawlers, and that rDNS is generally not provided. I've seen what appear to be legitimate robots from this IP range, so they must be renting additional performance from Amazon for crawling. Unfortunately, since there is no rDNS, I give them the boot. I hope that they'll come back later with an IP range that resolves, but if I drop out of their indexes because I blocked them, then this becomes an 'unsolvable' problem: Too many junk requests to allow non-rDNS-validated robot access, and this hosting model apparently does not support rDNS... :(

Jim

GaryK

9:49 pm on Sep 29, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I've seen what appear to be legitimate robots from this IP range, so they must be renting additional performance from Amazon for crawling.

I'm not saying this is a legit bot, but I have seen it in my logs before and it wasn't using Amazon so it seems you're right.

I wish Amazon cared about what things like this do to their good name more than they seem to care only about the bottom line.

Oh well, I've got more important things to worry about right now like today's financial bloodbath.

incrediBILL

2:32 am on Oct 1, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I wholesale block anything coming from amazonaws.com so they found a bad attitude on their first attempt.

I'll let them crawl depending on *MY* attitude! :)

[edited by: incrediBILL at 2:32 am (utc) on Oct. 1, 2008]