I admin my own VPS. I've tweaked Apache to block many IP ranges, mostly blocking traffic from global regions that generate traffic bent on doing more harm than good. (Login hacks, injection attacks, etc.)
I've also blocked a number of bots that rip my sites . . err . . harvest data . . for the benefit of the bot operator's clients (rarely for my benefit). These are bots used to study links, evaluate ranking factors, evaluate content for competitive analysis, copyright infringement, etc. (I feel like putting out a sign "I'll give you access to my data/content IF you give me access to your data.")
I understand there are bots used to gather data for agencies that use/sell access to that data, intending to assist clients in targeting sites as "advertising opportunities", that is, placing ads on those sites. I'm not clear which bots are the good-guys-gals and what their IP ranges are. I don't wish to block the bots (potentially) doing me some direct good.
Please list any bots / bot IP ranges that you know are used, specifically, to assist advertisers in choosing which sites to target with their AdWords Content Network advertising dollars/inventory.
Which bots are beneficial to those deploying AdSense? Which bots (besides G's bots) may help publishers make money via Adsense targeting?