Forum Moderators: open
Wilderness, I've seen the unspecified one as well - but that hasn't bothered me much.
By any chance, has anyone seen a browser that returns a null string? I can't ban this in .htaccess because Apache always returns "-" for a null string and some browsers legitimately have a hyphen...
I suggest you read deeply in this forum before implementing blocking. You can block user-agents starting with a specified string, user-agents ending with a specified string, user-agents containing a specified string, or user-agents which exactly match a specified string.
That's what all the hats and dollars ( "^" and "$" ) are for in the block lists you see here. These are present because the string-matching uses regular expressions [etext.lib.virginia.edu] - a pattern-matching specificaton language, if you will.
You really don't want to block null user-agents! There are many reasons a user-agent might be null, and many are beyond the control of the person using the browser. Corporate proxies often block UA, and Norton Internet Security and similar products will also block UA in their out-of-the-box default mode.
Jim
Jim,
I ALMOST! Agree with you here 100 %.
I have seen many devious attempts to access with a null UA.
The only vaild spidering I've lost as a result of denying blank UA's was I believe Lycos.
I emailed the bot's website and they were totally baffled by explanation that the only reason their bot was being denied was because of the null UA.
The amazing thing was that each time they spidered there were two log entires. One with both null Referrer and UA. Then the second where the robots.txt was read and the UA was identified.
The logic of this defies me. Yet they wouldn't contine with the spider as a result of the 403 from the initial null.
By the same token, I don't want someone ripping my site without any possible identification as to who it might be (other than an IP, of course) - and since I read my logs twice a day (I can hear you all laughing LOL) by the time I've discovered a rogue bot, it's too late. But personal anaylsis often makes the difference when determining whether or not the visit was legitimate.
Spearmaster,
Ilike your thinking :-)
I've had a visitor today at four different time frames from a University. They enter the main page. No robot's or nothing.
These types of visits have taugt me two be suspicious.
I'm sure the computer labs at many university's run 24/7 however I'm not sure they are out making probes on a holiday?