Welcome to WebmasterWorld Guest from 54.144.80.75

Forum Moderators: goodroi

Message Too Old, No Replies

Blocking a spider from a directory

     
5:34 pm on Nov 3, 2003 (gmt 0)

Junior Member

10+ Year Member

joined:Jan 5, 2003
posts:63
votes: 0


If I want a specific spider to not visit a directory I would use

User-agent: spidername
Disallow: /directory name

but what if the directory was:

[domainname.com...]

Would it be:

User-agent: spidername
Disallow: /index.php/cPath

or

User-agent: spidername
Disallow: /index.php

or

User-agent: spidername
Disallow: /cPath

TIA,
Javi :)

2:52 am on Nov 5, 2003 (gmt 0)

Senior Member

WebmasterWorld Senior Member jdmorgan is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:Mar 31, 2002
posts:25430
votes: 0


Javi,

Since robots use prefix-matching, I'd use "Disallow: /index.php/cPath"

Jim

4:30 am on Nov 5, 2003 (gmt 0)

Preferred Member

10+ Year Member

joined:Aug 20, 2003
posts:408
votes: 0


Just remember that sometimes the bots ignore the robots.txt and go in there anyhow. I have hundreds of dynamic links crawled even though I disallowed the directory.
 

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week

Featured Threads

Free SEO Tools

Hire Expert Members