Forum Moderators: open

Message Too Old, No Replies

207.230.106.188 DIIbot/1.1, www.findsame.com, robot@digital-

         

littleman

8:56 pm on Aug 3, 2000 (gmt 0)



These guys have been snooping for a while. It looks like they are now using there snooping bot to also build an SE. It is an interesting concept. Looks like they are raiding inktomi for urls.
www.findsame.com [findsame.com]
digital-integrity.com [digital-integrity.com]

Brett_Tabke

9:13 pm on Aug 3, 2000 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Is it crawler behavior though? Just just single page pulls?

littleman

9:22 pm on Aug 3, 2000 (gmt 0)



Yeah, it looks that way. It seems to be *slowly* following links. It also has been pulling the robot.txt for every <added>root</added> request.

redzone

2:53 am on Aug 4, 2000 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Brett/Littleman,
Noticed increased action from them also.. Have either of you had stepped up crawling from Matahari recently? They used to just hit and miss us, but over the last few days, have been hitting huge numbers of URL's....

PeteU

5:26 am on Aug 4, 2000 (gmt 0)

10+ Year Member



Bandwidth waste like this outfits earn a
deny from ip_range
entry in my access.conf files

Brett_Tabke

12:17 pm on Aug 4, 2000 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



I was thinking the findsame was just random stuff.
I can only find a few hits from digital integrity.

Pete, access.conf, nice work if you can get it - the rest of us are stuck with .haccess banning (slow).