homepage Welcome to WebmasterWorld Guest from 54.205.144.54
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Visit PubCon.com
Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
Forum Library, Charter, Moderators: goodroi

Sitemaps, Meta Data, and robots.txt Forum

    
Blocking a spider from a directory
JVB_Mktg




msg:1526919
 5:34 pm on Nov 3, 2003 (gmt 0)

If I want a specific spider to not visit a directory I would use

User-agent: spidername
Disallow: /directory name

but what if the directory was:

[domainname.com...]

Would it be:

User-agent: spidername
Disallow: /index.php/cPath

or

User-agent: spidername
Disallow: /index.php

or

User-agent: spidername
Disallow: /cPath

TIA,
Javi :)

 

jdMorgan




msg:1526920
 2:52 am on Nov 5, 2003 (gmt 0)

Javi,

Since robots use prefix-matching, I'd use "Disallow: /index.php/cPath"

Jim

PhraSEOlogy




msg:1526921
 4:30 am on Nov 5, 2003 (gmt 0)

Just remember that sometimes the bots ignore the robots.txt and go in there anyhow. I have hundreds of dynamic links crawled even though I disallowed the directory.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved