| Welcome to WebmasterWorld Guest from 126.96.36.199 |
register, free tools, login, search, subscribe, help, library, announcements, recent posts, open posts,
|Subscribe to WebmasterWorld|
|Blocking a spider from a directory|
| 5:34 pm on Nov 3, 2003 (gmt 0)|
If I want a specific spider to not visit a directory I would use
Disallow: /directory name
but what if the directory was:
Would it be:
| 2:52 am on Nov 5, 2003 (gmt 0)|
Since robots use prefix-matching, I'd use "Disallow: /index.php/cPath"
| 4:30 am on Nov 5, 2003 (gmt 0)|
Just remember that sometimes the bots ignore the robots.txt and go in there anyhow. I have hundreds of dynamic links crawled even though I disallowed the directory.
All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
© Webmaster World 1996-2013 all rights reserved