Forum Moderators: open

Message Too Old, No Replies

Is this really ia archiver?

What is the true IA user agent?

         

bouncybunny

12:28 am on Jun 13, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



This is trawling dozens of pages on my site. Is this IP really from archive.org?

64.208.172.* - - [11/Jun/2007:21:51:07 -0400] "GET /directory/index.html HTTP/1.0" 301 40329 "-" "ia_archiver"

thetrasher

10:39 am on Jun 13, 2007 (gmt 0)

10+ Year Member



Yes and no.

No, this IP is not from Internet Archive/archive.org. This IP range belongs to Alexa. A reverse DNS lookup should result in xcrawl**.alexa.com, but rDNS seems to be misconfigured. Forward DNS works; i.e. xcrawl28 is in 64.208.172.0/24.

Yes, this is the real ia_archiver.

bouncybunny

11:49 am on Jun 13, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Masterful. Thank you.

wilderness

4:15 pm on Jun 14, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



These folks crawl from more hidden IP's than most everybody.

64.213.203.145 - - [14/Jun/2007:11:12:01 -0500] "GET /robots.txt HTTP/1.0" 403 - "-" "ia_archiver"