Forum Moderators: DixonJones
69.28.130.*** - - [06/Dec/2003:17:06:54 -0800] "GET /robots.txt HTTP/1.1" 200 1524 "-" "QPCreep Test Rig ( We are not indexing, just testing )"
69.28.130.*** - - [06/Dec/2003:17:06:54 -0800] "GET / HTTP/1.1" 403 480 "-" "QPCreep Test Rig ( We are not indexing, just testing )" Had this one previously banned on a hunch that it's a variant of QuepasaCreep [google.com].
We are not indexing, just testing...for whom?
Any ideas?
Pendanticist.
Otoh, same thread has it on another IP belonging to the SE on [quepasa.com...] ... and some other IP's as well. I don't really know what to think of it, as that text is clearly not in Spanish, although it's probably intended to offer a little comfort to worried webmasters.
/claus
Host: 69.28.130.*** Url: / Http Code : 200
Date: Dec 07 02:41:56 Http Version: HTTP/1.1 Size in Bytes: 11276
Referer: - Agent: QPCreep Test Rig ( We are not indexing, just testing ) /<THANKS>/
I checked on Arin and it gave this:
OrgName: Limelight Networks, LLC
OrgID: LLNW
Address: 8936 North Central Avenue
City: Phoenix
StateProv: AZ
Cheers,
Goober
69.28.130.*** - - [08/Dec/2003:09:40:25 +0000] "GET /tmp/konq.png HTTP/1.1" 404 3876 "-" "QPCreep Test Rig ( We are not indexing, just testing )" 0 dorward.me.uk
69.28.130.*** - - [08/Dec/2003:10:03:39 +0000] "GET /robots.txt HTTP/1.1" 200 84 "-" "QPCreep Test Rig ( We are not indexing, just testing )" 0 dorward.me.uk
69.28.130.*** - - [08/Dec/2003:10:03:39 +0000] "GET /foaf.rdf HTTP/1.1" 200 9295 "-" "QPCreep Test Rig ( We are not indexing, just testing )" 0 dorward.me.uk
... and its ignoring my robots.txt
User-agent: *
Disallow: /tmp/
Disallow: /images/
Disallow: /notes/
Disallow: /lib/
NSLOOKUP does not resolve but a WHOIS gives me:
[Query: 69.28.130.229, Server: whois.arin.net]
OrgName: Limelight Networks, LLC
OrgID: LLNW
Address: 8936 North Central Avenue
City: Phoenix
StateProv: AZ
PostalCode: 85020
Country: US
ReferralServer: rwhois://rwhois.llnw.net:4321/
NetRange: 69.28.128.0 - 69.28.191.255
CIDR: 69.28.128.0/18
NetName: LLNW-1
NetHandle: NET-69-28-128-0-1
Parent: NET-69-0-0-0-0
NetType: Direct Allocation
NameServer: DNS.LAX.LLNS.NET
NameServer: DNS.LGA.LLNS.NET
NameServer: DNS.SJC.LLNS.NET
NameServer: DNS.IAD.LLNS.NET
Comment: Network reassignments available via
Comment: rwhois.llnw.net 4321
RegDate: 2003-03-07
Updated: 2003-07-09
OrgAbuseHandle: WP215-ARIN
OrgAbuseName: Petrisko, William
OrgAbusePhone: +1-602-850-5095
OrgAbuseEmail: ipadmin@limelightnetworks.com
OrgTechHandle: WP5-ARIN
OrgTechName: Petrisko, William
OrgTechPhone: +1-602-850-3089
OrgTechEmail: billp@wjp.net
# ARIN WHOIS database, last updated 2003-12-08 19:15
# Enter? for additional hints on searching ARIN's WHOIS database.
[End of Data]
llns.net looks like a hosting provider so this could easily be anything at this point. I'm not going to block it yet but I'm interested to find out what it is.
The worst part about their robot, besides ignoring robots.txt, is that the thing is just plain dumb. I receive about 4 hits a day from it and it invariable gets 3 404's because the thing tries to read .htm files even though all my pages and internal links are .html.
I'd say their testing is going terribly.
I think it's great that they are trying to do real world testing but if this test is sucking up our bandwidth uselessly then why should we help?
Where should the line be drawn exactly because there will be more of this in the future.
69.28.130.*** - - [19/Dec/2003:18:19:11 -0800] "GET /robots.txt HTTP/1.1" 200 1524 "-" "QuepasaCreep ( crawler'at'quepasacorp.com )"
69.28.130.*** - - [19/Dec/2003:18:19:11 -0800] "GET / HTTP/1.1" 403 480 "-" "QuepasaCreep ( crawler'at'[red]quepasacorp.com[/red] )" As you can see, I've got it banned. However, you'll also note they seem to have changed their UA to include a contact addy.
Perhaps they're set to begin?
On the other hand:
[search.msn.com...]
Pendanticist.
69.28.130.*** - - [19/Dec/2003:14:11:11 -0800] "GET /robots.txt HTTP/1.1" 200 734 "-" "QuepasaCreep ( crawler@quepasacorp.com )"
So h*tp://www.quepasa.com isn't who they are?
NetRange: 69.28.128.0 - 69.28.191.255
CIDR: 69.28.128.0/18
On the other hand has anyone been hit by these:
"Mozilla/4.0 compatible ZyBorg/1.0 Dead Link Checker (wn.zyborg@looksmart.net; [WISEnutbot.com)"...]
"SearchGuild_DMOZ_Experiment chris@searchguild.com"
"Exalead NG/MimeLive Client (convert/http/0.147)"
"Szukacz/1.5 (robot; www.szukacz.pl/jakdzialarobot.html; info@szukacz.pl)"
"Xenu Link Sleuth 1.2e"
"sitecheck.internetseer.com (For more info see: [sitecheck.internetseer.com)"...] (Never even requested there service)
"http://www.almaden.ibm.com/cs/crawler [c01]"