Welcome to WebmasterWorld Guest from 54.197.72.5

Forum Moderators: Ocean10000 & incrediBILL & keyplyr

Message Too Old, No Replies

207.230.106.188 DIIbot/1.1, www.findsame.com, robot@digital-

     
8:56 pm on Aug 3, 2000 (gmt 0)

Senior Member

WebmasterWorld Senior Member littleman is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:June 17, 2000
posts:2924
votes: 0


These guys have been snooping for a while. It looks like they are now using there snooping bot to also build an SE. It is an interesting concept. Looks like they are raiding inktomi for urls.
www.findsame.com [findsame.com]
digital-integrity.com [digital-integrity.com]
9:13 pm on Aug 3, 2000 (gmt 0)

Administrator from US 

WebmasterWorld Administrator brett_tabke is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Sept 21, 1999
posts:38066
votes: 15


Is it crawler behavior though? Just just single page pulls?
9:22 pm on Aug 3, 2000 (gmt 0)

Senior Member

WebmasterWorld Senior Member littleman is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:June 17, 2000
posts:2924
votes: 0


Yeah, it looks that way. It seems to be *slowly* following links. It also has been pulling the robot.txt for every <added>root</added> request.

2:53 am on Aug 4, 2000 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:July 8, 2000
posts:684
votes: 0


Brett/Littleman,
Noticed increased action from them also.. Have either of you had stepped up crawling from Matahari recently? They used to just hit and miss us, but over the last few days, have been hitting huge numbers of URL's....
5:26 am on Aug 4, 2000 (gmt 0)

Junior Member

10+ Year Member

joined:July 28, 2000
posts:134
votes: 0


Bandwidth waste like this outfits earn a
deny from ip_range
entry in my access.conf files
12:17 pm on Aug 4, 2000 (gmt 0)

Administrator from US 

WebmasterWorld Administrator brett_tabke is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Sept 21, 1999
posts:38066
votes: 15


I was thinking the findsame was just random stuff.
I can only find a few hits from digital integrity.

Pete, access.conf, nice work if you can get it - the rest of us are stuck with .haccess banning (slow).