Forum Moderators: open

Message Too Old, No Replies

Amazon

         

wilderness

6:00 pm on Dec 1, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Over the past few days, I'm getting an abnormal amount of activity from an Amazon IP utililizing a Java UA.

216.182.233.** - - [30/Nov/2006:20:31:23 -0800] "GET /myFolder/myPage.html HTTP/1.1" 403 - "-" "Java/1.5.0_09"

I'm not really against Amazon crawling (although the UA was denied).
I'd rather they provide a more relavant UA.

thetrasher

2:13 pm on Dec 2, 2006 (gmt 0)

10+ Year Member



It's Amazon Elastic Compute Cloud (EC2).
Amazon EC2 presents a true virtual computing environment
See also [webmasterworld.com ]

In October I saw "NutchEC2Test/Nutch-0.9-dev (Testing Nutch on Amazon EC2.; http://lucene.apache.org/nutch/bot.html; ec2test at lucene.com)" coming from 216.182.225.nnn and 216.182.236.nn.

216.182.224.0/20 belongs to Amazon's development center in South Africa. For me this IP range is worth a ban. I don't like anonymous crawling.

wilderness

3:32 pm on Dec 2, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Many thanks trasher.

An alternative to colo ;)