homepage Welcome to WebmasterWorld Guest from 54.211.219.178
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
Forum Library, Charter, Moderators: Ocean10000 & incrediBILL

Search Engine Spider and User Agent Identification Forum

    
207.253.45.203 Java1.3.0
jeremy goodrich




msg:396881
 3:13 pm on May 9, 2001 (gmt 0)

What/who is this? Got spidered a lot by this ip/ ua recently, and I'm wondering if there is any purpose to it.

Thanks in advance. Whois info follows:

[Query: 207.253.45.203, Server: whois.arin.net]

Le Groupe Videotron, Services PC (NETBLK-LGVL-207)
2155 boul. Pie-IX
Montreal, QC H1V2E4
CA

Netname: LGVL-207
Netblock: 207.253.0.0 - 207.253.255.255
Maintainer: LGVL

Coordinator:
VTL, Network Administrators (NAV1-ARIN) Net-Admin@VIDEOTRON.NET
514-899-8448 (FAX) 514-899-8452

Domain System inverse mapping provided by:

DNS1.VIDEOTRON.NET205.151.222.250
DNS2.VIDEOTRON.NET205.151.222.251

 

littleman




msg:396882
 7:30 pm on May 9, 2001 (gmt 0)

207.253.45.194 -> [netvention.com...]
These are also active via http:
207.253.45.195
207.253.45.196
207.253.45.197
207.253.45.198

If you do a tracerout on any of the above IPs or 207.253.45.203 they all fallow the same pattern. It looks like that is their bot's IP.
It's a B2B engine.

msgraph




msg:396883
 7:47 pm on May 9, 2001 (gmt 0)

I've had the same UA hit my sites from Washington State U. In my case it seems that someone in their Comp. Sci. Dept. was doing a class project out of one of their labs.

There must be some sort of Java-based program out there that all these people are using. I have yet to find which one it is.

Believe it or not I have also seen a few Altavista spiders with this same UA as well. It was just a temporary thing but they have used it.

jeremy goodrich




msg:396884
 8:02 pm on May 9, 2001 (gmt 0)

That was what was irritating me. I recalled some university using it, but not which. It was that or a research institution like IBM. Washington U, hu? Do you happen to know if they have posted about their project before I go digging around their pages?

Last time I got one of their spiders, it was libwww/version#. Now it's this java thing. Makes one wonder if they are really doing something, or just experimenting with different types of technology.

msgraph




msg:396885
 8:42 pm on May 9, 2001 (gmt 0)

Luckily the DNS had computer_science division or something. I didn't mind them grabbing the stuff if it was for some research but after time I was getting sick of them eating up bandwidth. I just went to their CS Dept. page on their site and asked the Head Techie if they were doing some sort of research.

He told me it was coming out of a lab classroom and that he would look into it. After he replied it stopped for a few days and then started back up again. It was like clockwork, everyday from 1-3 p.m. So, remembering my old college days, I assumed it was some student using their high-speed connection to grab a bunch of stuff. If it was a researcher I imagine that they would dedicate a computer and let it run day and night. But who knows.

Since it was comming out of a CS I lab, I just went ahead and banned their IP.

>>Last time I got one of their spiders, it was libwww/version#. Now it's this java thing. Makes one wonder if they are really doing something, or just experimenting with different types of technology.

Could be in my case. That's probably how the guys that made Google started out.

I don't think there is anything to worry about unless they are grabbing a lot of pages like a bot and where they are coming from seems suspicious. It is probably some open source page downloading tool/web browser out there that spits out this UA when it's used.

If there are any Java2 SDK developers or Java platform user out there I'm sure they can give us a better insight

MaliciousDan




msg:396886
 1:17 pm on May 10, 2001 (gmt 0)

Java 1.3.0 is the default useragent that the java runtime uses for URLConnections, it could be anything from a browser or spider to a horribly failed class project. It's probably not a professional anything though or at least at this point it's far from it otherwise they would have set a UA of some kind.

msgraph




msg:396887
 1:30 pm on May 10, 2001 (gmt 0)

>>horribly failed class project.

I hope it failed, they deserved it in my view.

Thanks Dan.

Everyman




msg:396888
 11:48 pm on May 10, 2001 (gmt 0)

On April 17, I had a bot with a user agent of
Java1.1.8 coming from cache2.gw.utexas.edu that
was trying a new GET at a rate of 13 times per
second. I detected them on the fly and cut them
off from my cgi-bin, but they went on to suck up
a couple hundered static pages.

I complained to abuse@utexas.edu and sent them
the zipped log, but never got an answer.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved