Google Spidering / Indexing Questions

To all,

Sorry if these are dumb or repetitive questions, but I was unable to answer them to my satisfaction, looking at old posts.

Please don't ask why (but there are reasons), I am running an "ancient" webserver - O'Reilly & Associate Website V1.1h (copyright 1997, and no longer being supported). It subscribes to HTTP Protocol V1.0 (and not V1.1). It appears, from my weblogs, that certain files on my server get spidered at least once a week, even though they have not changed. And when this occurs, my rankings tank for various keywords for this page, for a day or 2, until the "new" page gets indexed properly, at which time my rankings seem to return. This is repeatable, and I'm seeing this over and over again.

My questions are as follows:

1) When Googlebot visits my site, and accesses a file on my server, and the return status is "200", does this imply that Googlebot thinks that this file has changed, since its last fetch?

Does this imply that Google has fetched it, and it will determine whether this file has changed since its last fetch subsequently on Google's end ?

2) I have read about Last-Modified Headers and Last-Expired headers on WebmasterWorld, and given that my webserver is running HTTP Protocol V1.0, it does not support these features. Aside from moving to a different webserver, can anyone suggest a mechanism that I can employ, that would prohibit Google (or any other search engine) from retrieving files that have not changed since the last Googlebot fetch ? Unfortunately, there are no webserver configuration options that support this effort.

3) One other possible anomoly - I have had a Website V1.1h logging feature enabled, that generates extended log records, which is informationally useful. However, I noticed that the timestamp that this log format uses is GMT, whereas my server machine's file timestamps are in local time (EST). Could this 4-hour time differential (i.e., the delta between GMT and EST) possibly be causing Google to think that a file has changed, when in fact, it has not ? I do have other logging options, that timestamp webserver accesses via local time (i.e., EST, in my case), which would put things in synch with my local server time.

I'm apologize in advance if these sound like ancient or stupid questions, but I'm just trying to pick the board's brains !

Thanks in advance !

Doug

[edited by: engine at 11:59 am (utc) on May 14, 2008]
[edit reason] formatting [/edit]

Google Spidering / Indexing Questions

doughayman

tedster

Receptional Andy

doughayman

Receptional Andy

doughayman

Receptional Andy

doughayman

Receptional Andy

doughayman

doughayman

doughayman

tedster

doughayman

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week