Forum Moderators: open
I'm blocking it but not sure whether it's useful or not. Nothing to do with anything I've submitted to their webmaster tools.
It's likely that any url would go to a page that gave a generic "this is our bot, we obey robots.txt" page - which I think bonecho doesn't anyway since it's purporting to be a browser (may even BE a browser!).
To be frank, Yahoo really are getting annoying, especially for such a small visitor return. Last month the top three crawlers hit across all the sites on my server as follows:
bonecho 16,145
slurp 113,136
googlebot 28,418
msnbot 10,268
So, a total from yahoo (at least!) of 129,281 compared with 38,686 for BOTH the others - that's 70% yahoo to 30% for the other two. And now they are dropping Verify on us as well (two years? Haven't seen it before or it would be in my Bots list!).
It's not as if there's 113,000 pages on the server to begin with! I doubt it's a tenth of that - they are all smallish sites.
Last month I blocked 41545 hits from undesriable IPs plus 29789 SQL Injection hits. Yahoo exceeded those baddies by a LONG way.
Webmasters trade spider bandwidth for traffic. I agree that Yahoo goes overboard, but it's all ROI -- Each Webmaster must make the call based on how much quality traffic he/she gets from Yahoo.
Jim
Yes, the headers are wrong but it is possible to fake that with a minor firefox script. I'm very inclined to believe it is a robot, though.
Part of the worry with yahoo (and increasingly with google and msn) is: what happens if you ban their "weird" bots? Do they still give a listing based on slurp or is this modified by results from the others? And if so how?
My figures above omitted msn's referer spam in the 65.55.*.* range by the way - add another 5,000 or so hits for that, plus their msr bot and various others. Still nowhere near Yahoo's hits, though.