New bot? WIRE/0.1

Forum Moderators: DixonJones

Message Too Old, No Replies

New bot? WIRE/0.1

dhatz

4:03 pm on May 7, 2004 (gmt 0)

I have this one downloading thousands of pages across several websites and domains.

It did download robots.txt before going ahead, but it has no URL or other contact info, and I can't seem to find more about it at cluster.ischool.washington.edu which is who the IPs belong to.

I decided to deny access to it, until I learn more about it. This is from the logs (just one site, it's accessing several sites on different IPs, so it must have ample bandwidth, or really-really like me ;-)

128.X08.131.220 - - [07/May/2004:18:54:45 +0300] "GET /****.html HTTP/1.1" 403 235 "-" "WIRE/0.1 (Linux; i686; Bot,Robot,Spider,Crawler)"
128.X08.131.220 - - [07/May/2004:18:54:50 +0300] "GET /****.html HTTP/1.1" 403 235 "-" "WIRE/0.1 (Linux; i686; Bot,Robot,Spider,Crawler)"

[edited by: DaveAtIFG at 7:40 pm (utc) on May 7, 2004]
[edit reason] Obscured IP [/edit]

Sanenet

4:06 pm on May 7, 2004 (gmt 0)

Go to the IP (128.X08.131.220).

[edited by: DaveAtIFG at 7:41 pm (utc) on May 7, 2004]
[edit reason] Obscured IP [/edit]

dhatz

5:28 pm on May 7, 2004 (gmt 0)

So were you able to find anything about what they plan to do with that data?

Their bot has already downloaded over 10.000 documents from several Websites of mine, on different IPs, hosted in Greece and in US, in 6 different languages.

What's the point?

I found about WIRE bot here:

[cwr.cl...]

And I guess those people at UWashington have lots of spare bandwidth to waste...

Sanenet

5:35 pm on May 7, 2004 (gmt 0)

It seems to be just an app written by some university students from Chile. If it's annoying you, ban it. Or, if it's disobeying orders, send them an email telling them of it's bad behaviour. These guys love feedback!

Yidaki

6:04 pm on May 7, 2004 (gmt 0)

>These guys love feedback

Yep, robot programmers that don't hide normally appreciate feedback and need feedback to work on improvements and to repair bugs.

(Due to the feedback i sent to the ibm / almaden developer team a few years ago, their bot finally learned how to parse robots.txt files that are macintosh formatted - non standard cr / lf.)

volatilegx

2:29 am on May 8, 2004 (gmt 0)

So we've concluded that WIRE is an experimental spider not currently linked to any live search engine?

Yidaki

4:22 am on May 8, 2004 (gmt 0)

>o we've concluded that WIRE is an experimental spider

Yep. The software is free and available at the Center for Web Research University Of Chile. The requests are obviously coming from the University of Washington Information School where they run several projects related to web search. The most related project is the The Web Tango Project [webtango.ischool.washington.edu], a research project including Designing interfaces for enhancing Internet searching ...

sidyadav

5:28 am on May 9, 2004 (gmt 0)

I love these kind of things. Always fun to download, install, run.

WIRE produced some excellent results. I wonder if a company/person can actually use it in a commercial project.

Sid

New bot? WIRE/0.1

dhatz

Sanenet

dhatz

Sanenet

Yidaki

volatilegx

Yidaki

sidyadav

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week