Welcome to WebmasterWorld Guest from 22.214.171.124
Forum Moderators: goodroi
In the mean time you can take a look at WebmasterWorld's own
Robots Checker [searchengineworld.com]
It validates a robots.txt file and has some nice info on robots-txt.
This can be a full path, or a partial path; any URL that starts with this value will not be retrieved. For example, Disallow: /help disallows both /help.html and /help/index.html, whereas Disallow: /help/ would disallow /help/index.html but allow /help.html
That would only get:
But would also get:
The problem is, that I don't believe all spiders follow the spec that way. They tend to do sliding regexs.