Rewrite in conf files

I am trying to create a general rewrite conf file in order to address common issues across several web sites (currently 3 but counting). The idea is to reduce the chance of missing an htaccess file when adding a new bot etc to the server.

Are there any gotchas in doing this? I'm particularly concerned about blocking baddies and letting in known bots. Snippets from the conf file are below (elipses used to denote more lines of code). The whole lot is in a single conf file in /etc/apache2 and loaded from apache2.conf as:
include rewrite.conf

It is enclused in:
<IfModule mod_rewrite.c>
</IfModule>

RewriteEngine On
...
RewriteCond %(HTTP_USER_AGENT) ^Mozilla/5\.0\s$compatible;\sYandexBot/3\.0;\s\+http://yandex\.com/bots$$ [OR]
RewriteCond %(HTTP_USER_AGENT) ^Mozilla/5\.0\s$compatible;\sbingbot/2\.0;\s\+http://www\.bing\.com/bingbot\.htm$$ [OR]
...
RewriteCond %(HTTP_USER_AGENT) ^Mozilla/5\.0\s$compatible;\sDuckDuckBot-Https/1\.1;\shttps://duckduckgo\.com/duckduckbot$$
RewriteRule .* - [L]

The first line, Yandex, does not work in that yandex has repeatedly hit one site for the past two or three days and always received a 403. Bing and google (not shown) receive 200. I'm guessing Yandex is stuck in a groove after being blocked before I set up the above; I've had to block its IP ranges in iptables for now.

# kill bad user-agents - this is only one of several rewrite UA blocks that follow
RewriteCond %{HTTP_USER_AGENT} ^$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^[\"\'$%&*()-=+_@~#{}[]<>,.?/|\\\!] [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*(<|>|'|%0A|%0D|%27|%3C|%3E|%00).* [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^ht[tm][lpr] [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^.*(HTTrack|clshttp|archive|loader|email|nikto|miner|python).* [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^.*(winhttp|libwww|perl|curl|wget|harvest|scan|grab|extract).* [NC]
RewriteRule . - [F,L]

The above (apparently) blocks python but not zgrab (should be trapped by grab).

# common bad user-agents
RewriteCond %{HTTP_USER_AGENT} (agent|analy[sz]|anonymous|bandit|bot|brand|cherrypicker|collector|compatible;[a-z]|craftbot|crawl|deepnet|discover|download|explorer|file|greasemonkey|indy\slibrary|java|larbin|le[ae]ch|legs|link|lynx|mail|netcraft|ninja|n[-_\s]?u[-_\s]?t[-_\s]?c[-_\s]?h|open|php|proxy|ripper|script|search|seo|shodan|sitemap|snoop|sph?ider|stripper|sucker|survey|sweep|torrent|webpictures|webspider|worm) [NC]
RewriteRule . - [F,L]

This should block anything-[Bb]ot (except bing etc above) but does not. It used to when it was in htaccess. And yes, I restart apache after changes.

Is there something I should look out for when dealing with conf files or is there something stupid in the above code?

Rewrite in conf files

Any special limitations?

dstiles

ClosedForLunch

whitespace

dstiles

lucy24

dstiles

lucy24

dstiles

ClosedForLunch

lucy24

dstiles

dstiles

lucy24

dstiles

dstiles

whitespace

lucy24

dstiles

lucy24

dstiles

lucy24

dstiles

dstiles

lucy24

dstiles

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week