Forum Moderators: coopster & phranque

Message Too Old, No Replies

Robot snooping

         

toolman

1:43 pm on Oct 19, 2001 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I just got robbed! A competing company just slurped thru my entire site.

It was very polite at 1 request a second and didn't carry a spider "footprint". All I have is an ip and a UA of Mozilla/5.0.

I'm already blocking UA's by mod_rewrite. The question is...how can I stop something like this. Is there a regular expression that would id Mozilla/5.0 and requests in excess of 10@1 per sec?

Brett_Tabke

6:54 am on Oct 23, 2001 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Offhand toolman, the only way to work it, is to session track. That can be fairly involved and resource hungry.

littleman

8:31 pm on Oct 23, 2001 (gmt 0)



Basically, there is no way to block someone who wants to get at your publicly available stuff if they know what they are doing.