Forum Moderators: open
Personally I prefer to use case-insensitive regular expressions to identify spambots UA's - my current list looks like this (it's fed from what my site sees - there might be a downloader or unwelcome crawler in here but I've tried to take these out of this list);
e.*mail
extract
collector
^Mozilla/\d\.\d\s\(compatible;\sAdvanced\sEmail\sExtractor\sv\d\.\d+\)$
CherryPicker
Crescent
^Mozilla/\d\.\d\s\(compatible;\sMSIE\s\d\.\d;\sWindows\sNT;\sDigExt;\sDTS\sAgent$
e-collector
EmailCollector
^EmailSiphon$
EmailWolf
ExtractorPro
Microsoft\sURL\sControl
NEWT\sActiveX
Teleport
Telesoft
UtilMind\sHTTPGet
WebBandit
WebEMailExtrac
WinHttp\.WinHttpRequest\.\d+
Zeus\s*Webster
^Mozilla/3\.Mozilla/2\.01\s\(Win95;\sI\)$
^Internet\sExplore\s{0,1}\d{0,1}\.{0,1}[a-z0-9]+$
^Internet\sExplorer\s{0,1}\d{0,1}\.{0,1}\d{0,1}$
^IE\s\d\.\d\sCompatible.*Browser$
^Microsoft\sInternet\sExplorer/4\.40\.426\s\(Windows\s95\)$
^MSIE(\s\d\.\d¦)$
^Mozilla$
^Mozilla(\\¦/)\?\?$
^Production\sBot\s\d{4}B$
^Harvest
<note>The site turns solid vertical bar into broken vertical bar</note>
- Tony