Forum Moderators: open

Message Too Old, No Replies

Another new one!

Lickity_Split

         

carfac

6:53 pm on Sep 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Found this today:

24.163.40.18 - - [27/Sep/2002:23:09:37 -0600] "GET / HTTP/1.0" 200 4924 "-" "Lickity_Split+(http://www.dnnerprise.net/usp-spider.asp)"

requested robots.txt, and seemed to follow it- hard to tell, only grabbed a couple pages.

the URL in the UA does not come back to a real page, and the IP goes back to rr.com.

from Sam Spade: Error - dnnerprise.net doesn't exist

dave

bird

7:01 pm on Sep 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



It's the robot that can't spell. Try innerprise instead.

carfac

2:10 am on Sep 29, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



bird:

Not that "i" vs. "d" helps- it is still not a real domain...

dave

jdMorgan

1:35 am on Sep 30, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Changing the "d" to an "i" in the above URL worked for me. This seems to be a variant of the USP spider, with a modified user-agent string. The IP is from Roadrunner, so (rhetorically) who knows who owns it.

Jim

Woz

2:00 am on Sep 30, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



innerprise.net make a spidering program called URL Spider or URL Spider PRO for building search data. It runs from a local machine so whoever is spidering your site may be building their own personal diretory, or even a nich directory/engine to go online.

It is not a downloader so you don't have to worry on that score, but you may get some referrals out of whatever the person is building. I would leave it be.

Onya
Woz

mack

3:23 am on Sep 30, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



I would agree with Woz.
A lot of users of Gossamerthreads links 2 and hyperseek search engine use URL spider pro to gather specific data for building web directories. The spider (if standard) is suposidly server friendly. The user of the spider sends the bot to a selection of websites and it extracts pages that contain keywords that the spider owner has requested. That is why it is able to build specific or "on theme" directories.