Forum Moderators: open
66.249.64.58 - - [17/May/2005:11:12:59 -0400] "GET /robots.txt HTTP/1.0" 403 - "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.58 - - [17/May/2005:11:12:59 -0400] "GET / HTTP/1.0" 304 - "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.79 - - [17/May/2005:11:13:11 -0400] "GET /robots.txt HTTP/1.0" 403 - "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.79 - - [17/May/2005:11:13:12 -0400] "GET / HTTP/1.0" 304 - "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.79 - - [17/May/2005:11:13:12 -0400] "GET / HTTP/1.0" 200 5971 "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.58 - - [17/May/2005:11:22:55 -0400] "GET /robots.txt HTTP/1.0" 403 952 "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.58 - - [17/May/2005:11:22:55 -0400] "GET / HTTP/1.0" 200 7035 "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
and so on..
I'm new to all that stuff, but I can read that googlebot is asking for a robots.txt file, which is not on my server. First question: does it hurt me? Do I need to create one, at least an empty file?
And a second thing: the part "GET / HTTP/1.0" - means that he's asking about the root directory "/" right? Why he's not going deeper(?), like "GET /mysite.html HTTP/1.0", or something like that.. Am I missing something here?
And my primary website is at subdomain.domain.com (that address resides in DMoz for about a year) - can I somehow "encourage" googlebot to crawl that page?
How can I check if the "if modified header" is supported on my server?
What do you think about these logs?
Thanks.
<Files ~ "^\.robots\.txt">
Order allow,deny
Deny from all
Satisfy All
</Files>
I set it up to prevent people viewing my robots.txt file.
I've removed it already. Thanks for poiting that out.
Coming back to my question:
is this a regural googlebot's activity to ask just about a root directory "/" and a robots.txt file?
66.249.64.58 - - [17/May/2005:11:12:59 -0400] "GET / HTTP/1.0" 304 - "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.79 - - [17/May/2005:11:13:11 -0400] "GET /robots.txt HTTP/1.0" 403 - "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.79 - - [17/May/2005:11:13:12 -0400] "GET / HTTP/1.0" 304 - "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.79 - - [17/May/2005:11:13:12 -0400] "GET / HTTP/1.0" 200 5971 "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.58 - - [17/May/2005:11:22:55 -0400] "GET /robots.txt HTTP/1.0" 403 952 "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
66.249.64.58 - - [17/May/2005:11:22:55 -0400] "GET / HTTP/1.0" 200 7035 "-" "Googlebot/2.1 (+http://www.google.com/bot.html)"
These are fine.
Response codes:
200: Download a fresh copy of page
304: Not modified since last time I was here.
403: Not authorized.