Forum Moderators: Robert Charlton & goodroi
Thanks!
Here's another oddity in what you're seeing. According to the official Google Blog post about Verifying Googlebot [googlewebmastercentral.blogspot.com] "the recommended technique would be to do a reverse DNS lookup, verify that the name is in the googlebot.com domain, and then do a corresponding forward DNS->IP lookup using that googlebot.com name."
But this IP address doesn't seem to have reverse IP set up at all. So I'd say just forget about it, except that your image indexing is affected - so something's up here. Still, if you open up that class-b to be unfiltered, I suppose there's a chance that you would be more vulnerable to IP spoofing in this range from malicious bots. But then again, I guess malicious bots are even more likely to spoof a googlebot user agent anyway.
If Google Image Search traffic is useful for your business, I guess it's worth the experiment to open up the filter.
[webmasterworld.com...]
and
[webmasterworld.com...]
Still, I am having trouble getting images in the index. The big problem is it takes soooooooo looooooong before you see any results when trying to optimize for this. About a year ago, I changed all the images from pic1, pic2, pic3 to small-red-widget, big-green-widget, medium-blue-widget, etc. along with appropriate alt tags. All the images disappeared and I am still waiting for any fruits to bloom off of the labor tree ;-)
Oh well...
We ended up having to do an IP whois just before blocking scrapers just to ensure it wasn't one of these Google IPs. I am even curious if Google even uses IPs that it doesn't "own" as well. As every time we put our scraper scripts into effect we have a serp drop very soon after and when we take it off it goes back up. I was wondering if Google had a bot pretending to be human that we end up blocking that causes this. We have gone through the log files to ensure we only block bots and I whois them and they are all potential scrapers in the traditional sense.
I just assumed this was the google image bot
I don't think so, this is something different.
I just checked and I'm seeing this IP but I didn't see it ask for robots.txt.
However, I did see this user agent also associated with the IP:
"Mozilla/5.0 (X11; U; Linux i686 (x86_64); en-US; rv:1.8.1.14) Gecko/20080404 Firefox/2.0.0.14"
I could be wrong but so far every time you see a combination of images and Firefox Linue it's typically someone making screen shots. However, I suspect Google would be making a lot more screen shots if that's what they're really doing unless they're just experimenting because images and HTML combined from this IP was only 32 files.
FWIW, don't worry about blocking things from Google's IP ranges in general because they have other services that have been used by scrapers which I block all the time without repercussion.