|Is a Googlebot a Googlebot a Googlebot?|
Or does each have its own "personality"?
I was looking over my recent logs and I noticed that I have about 2 dozen visits from Googlebots (assuming that they are the real thing, and not pretenders), all of which came from these 3 IP ranges:
So now I'm wondering -- does each Googlebot behave exactly the same? (in the way that it goes through a website). Or could each of those 3 IP zones have slightly different characteristics? For example, one puts more emphasis on alt tags; the next puts more emphasis on words in bold; the third gives more weight to image names; etc.
If they do not behave exactly the same then it seems that each one would have an impact which could vary -- for better or worse -- on how the SERPs are determined.
It is probably difficult to gauge this sort of relationship (Date > Googlebot > IP range > SERPs), but as I said, I'm curious whether it plays any role at all in the shifts we see.
You can verify if a googlebot user agent is really from Google - see How to verify Googlebot is Googlebot [webmasterworld.com].
A second factoid in this mix: all of the google spiders now share a crawl cache, which is then used by the algorithm to score the SERPs. So, the spider and the algorithm are two separate steps. The spider just retrieves the pages and stores them in Google's shared crawl cache - then the algorithm ranks them.
You can see some "interesting" patterns of fetch if you look at your logs over a period of time.
Thanks tedster -- I figured that the bot was only retrieving data, but wanted to confirm from someone with expertise in the field.
g1smd -- am not sure what you mean by "interesting patterns of fetch" -- are you referring to the way the bot moves through a site? That level of analysis is something I've not done yet, so to be honest I may not fully grasp what the pattern would be telling me.
Several things: how they request URLs across a site, how often stuff is fetched, how some stuff is fetched more often than others, and so on.
The pattern reveals little or nothing about how things work, but sometimes you can attribute a change of pattern with something that you did to the site content, or internal navigation, or linking pattern.
I helped out on another forum that was under attack about six months back. The bot attacking was called something like "GOOGLEBOTRUSSIA", of course I realised immediately it didn't belong to google and blocked it to recover the site but there are spam bots masquerading as google.Its well worth checking.
|but there are spam bots masquerading as google. Its well worth checking. |
Tedster told me in another thread that since Google can/does change its IP, it is not dependable to keep a "white" list. But to date -- for me at least -- all the googlebots I see are in the range 66.249.65.xx, 66.249.66.xx, and 66.249.72.xx. So if I see anything calling itself googlebot from outside 66.249, I will be suspicious.
If you can employ the method described in this link, you don't need to be suspicious - you can just plain-old know for sure. It also works for slurp, by the way.
How to verify Googlebot is Googlebot [webmasterworld.com].
[edited by: tedster at 11:40 pm (utc) on May 4, 2007]
Google changes IP addresses from time to time, but the range of IP addresses that is available tyo them is well known and in the public domain. They have several very large blocks and a number of smaller allocations.