Welcome to WebmasterWorld Guest from 54.144.79.200

Forum Moderators: goodroi

Message Too Old, No Replies

Blocking a spider from a directory

     

JVB_Mktg

5:34 pm on Nov 3, 2003 (gmt 0)

10+ Year Member



If I want a specific spider to not visit a directory I would use

User-agent: spidername
Disallow: /directory name

but what if the directory was:

[domainname.com...]

Would it be:

User-agent: spidername
Disallow: /index.php/cPath

or

User-agent: spidername
Disallow: /index.php

or

User-agent: spidername
Disallow: /cPath

TIA,
Javi :)

jdMorgan

2:52 am on Nov 5, 2003 (gmt 0)

WebmasterWorld Senior Member jdmorgan is a WebmasterWorld Top Contributor of All Time 10+ Year Member



Javi,

Since robots use prefix-matching, I'd use "Disallow: /index.php/cPath"

Jim

PhraSEOlogy

4:30 am on Nov 5, 2003 (gmt 0)

10+ Year Member



Just remember that sometimes the bots ignore the robots.txt and go in there anyhow. I have hundreds of dynamic links crawled even though I disallowed the directory.