Forum Moderators: open
Java/1.4.1_04
Java/1.5.0_03
Java/1.4.2_03
Java/1.4.2
Am I safe in banning these as follows?
SetEnvIfNoCase User-Agent "^Java" bad_bot
<Limit GET POST>
Order Allow,Deny
Allow from all
Deny from env=bad_bot
</Limit>
Or do I run the risk of banning 'legitimate' java tools?
To be honest, as long as Google, Yahoo and MSN aren't likely to be affected, I'll take the risk of upsetting that .1% legitimate visitors. ;-)
If you ever want to use the Google "Remove URL" tool, you need to be aware it uses a Java UA. If you are certain you will never use that tool you are OK, otherwise you might need to whitelist the relevant Google IPs.
I manually review bots before banning them. And this time these bots must have ripped through most of my site in the two days it took me to notice them. I don't check the logs every day and this bunch seem to have systematically trawled every single directory and link they could find. Although they didn't fall for any of my robots.txt honeypot directories, so perhaps they respect the robots rule.
Ho hum. At least they won't be back hopefully.
RewriteCond %{HTTP_USER_AGENT} Java [NC,OR]
Seems to take care of any/all instances... I've used this line for years with no ill-effects... however you should use your own discretion when banning bots.