Forum Moderators: open
anyway to technically identify a bot and differenciate
Technically no however. . .
If your experinced in reading your logs (as to how visitors have travelled both good and bad in the past)and knowledgeable about the content of your web pages and the realtionship of your pages? All of this added together to analyze how the current visitor is going through your site than you make a logical assumption. "Known" bots provided in this forum and other places also provides an enhancement to your assumption.
Also the bot traps do a good job of separating these visitors as well.
You can also tell by watching the path... does it run cgi scripts that need user input, or only pages?
Finally, grep -c... see HOW many pages it looked at!
All of these (when taken together) should help you!
dave
In addition, slurp (Inktomi's bot...) will identify as mozilla as well, and FAST will sometimes spider with a generic (mozilla) user agent.
The only sure fire way I know is to strip the inbound headers when somebody does a request to your site...many agents will have extra headers that 'real' browsers won't have.
Also, most bots will not use session variables & cookies...if you do all that, go through the headers, and use sessions variables & cookies - check for image calls as well - then you should be able to sort most of them.