Forum Moderators: phranque
All that cool stuff those guys are doing is neat but is just not worth the time and expense for me to figure out or hire somebody to do.
That is a wise approach.
> Could somebody come up with something us non techie people could use to just paste in our .htaccess files that just stops the most well known easy to spot bots by user agent?
The problem is not one of techie vs. non-techie. It's one of identifying "bad bots" and "the most egregious offenders" for your site(s). My list of the worst bad ones may be ineffective on your site, because yours attracts a different mix of bots.
The first step is to identify *your* problem bots, and then use the code snippets in that thread as examples to build your own code. Reviewing your stats should give you a good idea of which bots are the most trouble for you. Maybe a small derivative example will be more helpful than the huge lists in that thread:
Options +FollowSymLinks
RewriteEngine on
RewriteCond %{HTTP_USER_AGENT} Indy.Library [NC,OR]
RewriteCond %{HTTP_USER_AGENT} RPT.HTTPClient/ [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebStripper/
RewriteRule .* - [F]
RewriteRule !^path_to_403_error_page\.html$ - [F]
Apache mod_rewrite documentation [httpd.apache.org]
Apache URL Rewriting Guide [httpd.apache.org]
Regular Expressions Tutorial [etext.lib.virginia.edu]
Jim