homepage Welcome to WebmasterWorld Guest from 54.163.72.86
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
Forum Library, Charter, Moderators: goodroi

Sitemaps, Meta Data, and robots.txt Forum

    
disallow: /*
robot.txt newbie, please help
emm0uk




msg:1527977
 11:11 pm on Mar 29, 2004 (gmt 0)

i'm new to robots and i would like a simple (if possible) way to prevent robots from indexing all of my site content apart from the contents of the root folder. I have sub directories that i do not want indexing, but would rather not put the name of the folders on a robots.txt, so i was wondering if i can use a wildcard.
eg:
user-agent: *
disallow:/ *
Would this allow my root folder to be indexed, but prevent sub-folder from being indexed?
Also, how can i hide the robots.txt from all apart from robots? Script examples would be much appreciated as i'm pretty new to web building. Thanks all.

 

moltar




msg:1527978
 11:15 pm on Mar 29, 2004 (gmt 0)

Welcome to WW emm0uk!

That would not do the trick. You need to specify each folder individually to avoid spidering.

Why would you want to hide them from robots anyways? If one can find it through your site then there is no need to hide it. If nobody can find it, then spiders won't find it either.

check out Robots.txt Tutorial [searchengineworld.com].

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved