Forum Moderators: phranque
Can anyone help me implement such a policy using mod_rewrite (or some other means).
Thanks,
Jeff Patterson
The basic problem, described here many times [google.com], is that referrer-based access control is unreliable because the Referer header is unreliable; Many corporate and ISP caching proxies block the Referer header, and many browsers and "Internet Security" software packages can be set by the user (or come set by default) to block the Referer header.
It is also folly to force all search-derived traffic to a landing page -- This is a recipe for losing visitors quickly. Especially when those visitors are 'distressed' and are likely not part of a tech-savvy Web demographic. It's important to back off from the problem at hand, and take a look at the big picture: The effects on search-engine derived traffic and site usability issues are both important. If you block rogue 'bots, but kill your site, what's the point?
A cookies-and-script -based approach (described in some of the previous threads) with exclusions to allow search engine robots to spider the site, is a better way forward.
Jim
So I don't want to redirect _all_ search engine originated visits. Just the ones that are trying to access the message file directory. I want to force all access to these files through my cgi script where I can detect and block scrappers.