Forum Moderators: Robert Charlton & goodroi
I have now attributed this to occurring for my domain, when the main page of my site (and domain) apparently gets spidered by Googlebot twice in succession, within 1 second of each other. I garnered this from my webserver logs, and have now gone back in history to see that this is a problem.
The first spidering returns a 304 (cached version) by my webserver, whereas the second spidering (1 second later) results in a full fetch (webserver return code of 200).
Does anybody have any thoughts on:
1) Why the double-spidering by Google in the span of 1 second?
2) Why this may have weird effects on my rankings almost immediately
upon the double-spidering?
Thanks in advance, and Happy Fourth for those celebrating.
To remedy, I wind up changing content on this page, re-submitting my sitemap to Google, and once this page is re-indexed (singularly), after a day or 2, my rankings are back to where they once were.
I am extremely perplexed by this sequence of events, and for the life of am trying to explain this double-spidering, which leads to this problem.
This makes me think that the second request you see from googlebot may have a weird "If-Modified-Since" header with an ancient or otherwise buggy time stamp (not the time of the request, but the time that the if-modified-since refers to). If your server is operating properly, that should be the only time it returns a 200 status and gives the full html document in response.
Can you capture the "If-Modified-Since" header that googlebot is sending on the second request? That may give you a big clue.
You can report googlebot crawling issues from this page [google.com] -- follow the instructiions on validating that it really is your site, attaching a logfile, etc.
PS - I don't know if the googlebot team would care about the apparent tie in to ranking changes or not. Their main area of focus would be the doubled request and any technical error in the header it includes.
Now as to why they'd do that, and whether and why this ties in with a -950 penalty, I've no idea. But is is entirely possible to see a 304-Not Modified response followed by a 200-OK, since it is the client that determines whether to include If-Modified-Since in its request.
Jim
In regard to the googlebot link that you provided, after reading it carefully, I'm a little loathe to be submitting my site to this page. It appears that it should be used ONLY if you are uncomfortable with googlebot's crawl rate to your server. The last thing I would want to do is for Google to process my request, and "downgrade" or eliminate altogether the crawl rate to my site.