Forum Moderators: DixonJones
I just ran a test query against a sample of 200000 request I had ready in a database.
SELECT Left(uagent,10) AS uagent
FROM ex060704
GROUP BY Left(uagent,10);
Result 54 records.
Mozilla
msnbot/
Opera/9
ColdFus
Yahoo-M
Opera/8
Those 6 account for about 99.5% of all requests.
Just as a starting point: anaylse a couple of million request this way, create the list of starting strings with length 10 (or wahtever you like)
Hand edit this list (you will probably weed out another half of them) - white-flag the Rest. This way you'll probably get rid of the ramdom user-agent-strings.
Combine this with a honey pot (link that no user will normally click on) - and wave them good-bye.
I'd check every once in a while if new browser or SEs are knocking.
2cts ...
nerd.
The white list will mostly work, but once on the defensive like that, you are going to need serious ongoing maintainence to keep the white list up to date, which will get forgotten and over time there will be an erosion on genuine visitors who are inadvertantly being banned for using a new Beta browser or a new platform.
Dixon.