Forum Moderators: goodroi
User-agent: *
Disallow:
Thanks for any advice.
These errors may also skew the results of your 'stats' program, if you use one.
favicon.ico, w3c/p3p.xml, and labels.rdf are three more standard resources you might consider providing.
The code you posted looks fine. Put a blank line after the "Disallow:" line for maximum compatibility (Every "record" in a robots.txt file should be followed by a blank line, and there was one (European?) 'bot a few years ago that insisted on its presence, even for the last record).
There have also been unconfirmed reports that having a robots.txt file increases the number of pages spidered by MSNbot on your site. So far, not enough data has been collected for me to conclude that this is true.
Jim
Thanks for your advice. Yes, there have been a few favicon.ico errors too. Don’t know why that is because favicon.ico should be linked on all my pages and resides in the root directory. Maybe I'm missing one or two. I'll check.
One other question if you will permit. A large number of errors (over 100 each in last two months) are requests for "mysite.com/index.htm/" and "mysite.com/defaultsite". Surely an error 404 page is served in these instances because the pages don't exist. But should I direct the bots not to look for them?