Forum Moderators: DixonJones

Message Too Old, No Replies

IXE Crawler

NO robots.txt

         

pendanticist

8:55 am on Jan 4, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



131.114.3.*** - - [04/Jan/2004:00:04:13 -0800] "GET / HTTP/1.1" 200 20402 "-" "IXE Crawler"

Search results for: 131.114.3.***

OrgName: CNR - Istituto CNUCE
OrgID: CIC-1
Address: Instituto del Consiglio Nazionale Delle Ricerche
Address: via Santa Maria 36
Address: 56100 Pisa
City:
StateProv:
PostalCode:
Country: IT

NetRange: 131.114.0.* - 131.114.255.***
CIDR: 131.114.0.0/16
NetName: PISA-NET
NetHandle: NET-131-114-0-0-1
Parent: NET-131-0-0-0-0
NetType: Direct Assignment
NameServer: SERRA.UNIPI.IT
NameServer: NAMESERVER.CNR.IT
NameServer: SIMON.CS.CORNELL.EDU
NameServer: NS1.SURFNET.NL
Comment:
RegDate: 1988-11-15
Updated: 1997-03-06

TechHandle: SS4883-ARIN
TechName: Suin, Stefano
TechPhone: +39 50 24066
TechEmail: stefano'at'unipi.it

# ARIN WHOIS database, last updated 2004-01-03 19:15
# Enter? for additional hints on searching ARIN's WHOIS database.

Doing just an exact phrase: IXE Crawler [google.com] there were some interesting hits. The one at the top is a .pdf file which explains the historical chronology of this spider.

64 pages of text twisted sideways was too much to endure, so I read partway down until I saw it mentions this bots 'Unleashing' as sometime in '02.

I've never seen this one come calling before and am intersted in finding out more on it. For now though this puppy is banned just on principle.

Let's see. That's makes two newly ressurected bots found perusing my domain today alone. And, with this one emminating from an Italian 'Institution' and Pita [webmasterworld.com] coming in from UCLA earlier, I'm beginning to wonder if some of these educational institutions aren't doing some International collaborations.

Wonder what for...

Surely they can't be that inane.

pendanticist

5:26 pm on Feb 9, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



...

bull

4:07 pm on Feb 25, 2004 (gmt 0)

10+ Year Member



Was here today, fetched robots.txt though. Same IP block.
Disallowed her now, let's see how she behaves.

webvaccrawler

7:55 pm on Apr 1, 2004 (gmt 0)

10+ Year Member



WebVac is from open source code that previously
identified itself as Pita.
UCLA has picked up that code and did not change the
name.
I have contacted them to change their name
and they have agreed.
When they tell me the name, I will repost.