Forum Moderators: open
when requesting my brand new pages it sends
HTTP_IF_MODIFIED_SINCE=Mon, 10 Mar 2003 10:00:00 GMT
But Googlebot is not telling the truth! 10th of March my new domain wasn't even registered!
Does anyone else have any exprecience of this?
Leif
header("Status: 304 Not Modified")
Therefore it's really important that the bot is telling the truth, or else it will not get my pages. If it turns out that the bot sending the header wrongly, I would have to check by myself if the bot is telling the truth or not. that would be a very time consuming task.
I think that everyone should expect that if a bot is sending a HTTP_IF_MODIFIED_SINCE date, then it already have a copy of the webpage. That's the whole point!
Leif
I look at the HTTP_IF_MODIFIED_SINCE, if the bot (any bot) is sending me one, then I'm returning header("Status: 304 Not Modified")
That is wrong, you should not just look wether it is sent or not, you should compare the date it sends after the tag to the date of the last time your content/page changed.
Of course the bot may also be doing a mistake, maybe intentionally (They always send the "time" of the last major crawl or sth. like that, simply for performance reasons) or maybe in connection with the grand change that is taking place right now.
I guess we learn another one of life's standard lessons: If two parties rely on unproved assumptions when they meet, they might both be disappointed... ;)
If-modified-since is a little bit of trouble to set up for dynamic sites (not so hard for static sites), but it's definitely worth it. If Googlebot can see that it doesn't need to crawl an older page, that frees up resources to get a new page.