homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
Forum Library, Charter, Moderators: goodroi

Sitemaps, Meta Data, and robots.txt Forum

Something a little different

10+ Year Member

Msg#: 3277736 posted 6:30 pm on Mar 10, 2007 (gmt 0)

Is there any way to restrict acces to subdirectory of a main directory if the main directory will be dynamic. For example;

I have a mod_rewrite which produces URLs like this;

sadly it also supplies the same content under;

(where n i sthe id of the category the item was posted)

can i stop bots spidering dynamic-item-title/catid/?

I'm sure there's probably a mod_rewrite i could do to solve this too, but thats something i cant figure out lol



WebmasterWorld Administrator goodroi us a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

Msg#: 3277736 posted 1:56 pm on Mar 12, 2007 (gmt 0)

you could use wildcards aka pattern matching. this is not part of the official robots.txt protocol but it is supported by most of the big spiders. it enables you to block any url that contains a certain string in it.

Google Pattern Matching Instructions [google.com]
Yahoo Wildcards Instructions [ysearchblog.com]
MSN Instructions [search.msn.com]

please note this is not supported by all spiders

Global Options:
 top home search open messages active posts  

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved