Forum Moderators: Robert Charlton & goodroi
however take a look at this.
[root@some-server ~]# grep Googlebot /var/log/#*$!xx/access.log ¦ grep robots ¦ tail
66.249.71.233 - - [13/Nov/2008:20:54:43 -0600] GET /robots.txt HTTP/1.1 "200" 269 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" "-"
66.249.71.233 - - [13/Nov/2008:20:54:47 -0600] GET /robots.txt HTTP/1.1 "200" 269 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" "-"
66.249.71.233 - - [13/Nov/2008:20:55:01 -0600] GET /robots.txt HTTP/1.1 "200" 269 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" "-"
66.249.71.233 - - [13/Nov/2008:20:55:08 -0600] GET /robots.txt HTTP/1.1 "200" 269 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" "-"
66.249.71.233 - - [13/Nov/2008:20:55:13 -0600] GET /robots.txt HTTP/1.1 "200" 269 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" "-"
66.249.71.233 - - [13/Nov/2008:20:55:18 -0600] GET /robots.txt HTTP/1.1 "200" 269 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" "-"
66.249.71.233 - - [13/Nov/2008:20:55:18 -0600] GET /robots.txt HTTP/1.1 "200" 269 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" "-"
66.249.71.233 - - [13/Nov/2008:20:55:20 -0600] GET /robots.txt HTTP/1.1 "200" 269 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" "-"
66.249.71.233 - - [13/Nov/2008:20:55:21 -0600] GET /robots.txt HTTP/1.1 "200" 269 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" "-"
66.249.71.233 - - [13/Nov/2008:20:55:24 -0600] GET /robots.txt HTTP/1.1 "200" 269 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" "-"
[root@some-server ~]#
I havent made any changes to robots.txt for ages, the file is normally fetchable, and doesnt throw any error....
Is this something I should be worried about?
What could possible make googlebot keep on refetching the robots again and again.?
Use the Live HTTP Headers add-on for Firefox/Mozilla, or one of the many on-line headers checkers.
kylee's post reminded me of when I last saw this same thing (several years ago), and the reason was due to a server configuration error; My server response headers were telling Google not to cache the robots.txt file for more than a few minutes -- and they complied... :o
Jim
[edited by: jdMorgan at 2:46 am (utc) on Nov. 15, 2008]
the http headers
me@my-laptop:~$ HEAD http://sub.example.com/robots.txt
200 OK
Connection: close
Date: Sat, 15 Nov 2008 04:33:39 GMT
Accept-Ranges: bytes
ETag: "57e00e-118-4549545197840"
Server: nginx/0.6.32
Content-Length: 280
Content-Type: text/plain; charset=UTF-8
Last-Modified: Sat, 16 Aug 2008 15:19:53 GMT
Client-Date: Sat, 15 Nov 2008 04:33:39 GMT
Client-Peer: xx.xx.xx.xx:80
Client-Response-Num: 1
Ill try notifying via webmaster central. Thanks for the advice
[edited by: tedster at 6:34 am (utc) on Nov. 15, 2008]
[edit reason] de-link the url [/edit]