Forum Moderators: DixonJones
66.249.65.163 - - [22/Dec/2004:08:46:59 -0800] "GET /robots.txt HTTP/1.1" 200 1727 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.65.163 - - [22/Dec/2004:08:46:59 -0800] "GET /MYfileExtension.html?proceed=1&counter=1 HTTP/1.1" 200 12819 "-" "Mozilla/5.0
Since that request rendered a 200 when it perhaps should have sent a 404, I searched for ?proceed=1&counter=1 [google.com], but all you seem to see there are a bunch of hit counter (? ) results.
So, my question is this: Is this a special request sent out by the googlebot? Say, just looking for counters? If so, Why?
If not, then how come when other visitors add some jibberish at the end...
GET /MYfileExtension.html&s=svsxahafavfpHTTP/1.1"404 2847 "Removed by me" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)
...it renders a 404 as it should. The file does not exist.
All files that do not exist should, in effect, be sent to my custom 404 page.
Yet, Google marches right thru that premise.
Anyone care to elaborate?
[beta.search.msn.com...]
Not so much a help, as a place to start investigating.
66.249.65.163 - - [22/Dec/2004:08:46:59 -0800] "GET /MYfileExtension.html?proceed=1&counter=1 HTTP/1.1" 200 12819 "-" "Mozilla/5.0
Your server sent 12819 bytes of something in response to that request... have you requested the URL yourself to see what was actually sent; or if the URL responds 404 for you; are there any other "normal" log entries with a response size of 12819?
Ex: /WhateverTheFileRequested?proceed=1&counter=1
I bring up the root url, paste 'the above' into the address bar, just beside the root url and click thru.
The page I go to is mine, but also includes the proceed_counter_thing at the end of the root URL.
Google say: ?proceed=1&counter=1 [google.com]
Also, no matter what interior page I open, it still replicates the same, so it's not restricted to the root URL.
Such to say, I can go to an interior page, paste the ?proceed=1&counter=1 onto the end and hit the go button taking me directly to that page.
All such requests were presented with all the content requested by their browser.
Aside from this little oddity, I wonder how / why Google landed on it?
Where does this ?proceed=1&counter=1 originate and what is it's purpose?