Forum Moderators: Robert Charlton & goodroi
I'm new to Webmaster World, and I'm not sure if anyone has every posted about this, but I discovered an interesting piece of info about Google Bot this week. One of my newer accounts did not have a robots file on the server, and for some reason when the robots file was requested the server was returning a '500' response instead of '404'. I found that even though all of the other files on the site worked properly, Google will not crawl a website if the robots file returns a '500' response code, because it is not sure if the file actually exists. In layman terms - this means that Google will not crawl a website unless it is sure it is allowed to.
I also noticed that Yahoo and MSN will index a site even if the robots file returns '500'.
Anyways, if you have a site that isn't being indexed by Google and you cannot figure out why, try checking the response code that is served when your robots.txt file is requested.
Hope this helps everyone,
Paul