Forum Moderators: open

Message Too Old, No Replies

Yet another email collector

Light Sense Inc HTTP Control

         

bobriggs

3:29 pm on Nov 12, 2001 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I couldn't find a previous post on it.

Characteristics:
UA: Light Sense Inc HTTP Control.

Attempted to fetch /contact/ and /contact (without trailing slash) Funny, didn't try for contact.html ;)

Didn't look at robots.txt, of course.

BTW, the product page on this site also lists a matching 'spider mailer' mass email program.

What's the best way to deal with these bots? I think it's almost impossible to keep up with all the UAs.

What about disallowing access (403) to the page if a referer string is not given? This is OK if you don't want it spidered by a valid SE bot, but what about browsers that can turn off their referer string? Any other ideas?

toolman

4:19 pm on Nov 12, 2001 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



The bulletproof way is to not have your email exposed.

Hard code your email into your form processor.