Forum Moderators: open
google
inktomi
AV/FAST
wisenut
teoma
openfind
What's the safest and most efficient way to identify their spiders when you search for a substring of $HTTP_SERVER_VARS['HTTP_USER_AGENT'] in php?
"googlebot" and "slurp@inktomi.com" will obviously work well for the first two. What about the other four?
"teomaagent"
"crawler@fast.no"
"WISEnutbot.com"
"robot-response@openfind.com"
Would that work? My concerns are that maybe one of the strings isn't up-to-date or is bound to change in due time. Would ALL the bots of the respective search engines contain these strings? Also, I wouldn't want to use anything that could also be part of a regular UA string, hence leaving a "normal" user without a proper session.
Thanks!
===
The best way is indeed to check there IP adress ...
i got a tool for macintosh computer .. i am sure there
is something simular for PC ...
i type in the Ip adress of a robot ...
see sample: 64.68.82.46
my log says its from google. but to be 100% sure
i run it in this program it tells me this:
ip is from: crawler11.googlebot.com.
now i know for sure its google