Forum Moderators: open
I get 1572 hits/mo from 65.214.39.180 (registered to ask.com) and 1025/mo from 193.95.154.69 (their UK arm), most of which are for my robots.txt; all the other requests from them have Minefield in the requesting agent field, which seems to be a Mozilla project. Are those user views or bot requests?
Here's what I think is correct so far:
For each GET *.html entry in my access log, search for bots first (case insensitive):
Google: googlebot 200/mo
MSN: msnbot 1500
Yahoo: slurp 3000
Ask: minefield 500
Once these entries are removed, search for a user hit from the engine:
Google: .google. 10,000
MSN: msn.co OR search.live 300
Yahoo: search.yahoo 1000
So, I seem to be getting a good (48:1) benefit from Googlebot, but less than breakeven from MSN or Yahoo. I haven't figured out how to identify ask.com user views.
Has someone looked into this that I haven't located? If not, any suggestions, particularly of bot-directed users I haven't identified?
Many folks today are using browsers with the refer turned off, or, they may even copy and paste the link (direct access absent of referral), and even proxy or content filters make all this difficult.
"Historically" Jeeves has never provided enough visitors to most websites in comparison to justify the constant crawling their bot (s) do.
Add to that, more Jeeves tools crawling from various ranges simultaneously and the rate of visitors is even lower.
<snip>
Two months after robots.txt'ing out Yahoo and MSN, I have an interim result of a 14% drop in user views vs. the 3% expected. Metasearchers aren't significant. I'm looking for other sources of traffic dependent on these two bots, and will appreciate any suggestions beyond the refer-off/proxy browsers that have already been mentioned here.
[edited by: volatilegx at 3:54 pm (utc) on Dec. 18, 2007]
[edit reason] no urls please [/edit]