homepage Welcome to WebmasterWorld Guest from 54.163.72.86
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
Forum Library, Charter, Moderators: goodroi

Sitemaps, Meta Data, and robots.txt Forum

    
robots.txt syntax for wildcards?
Marfola




msg:3647778
 10:57 am on May 12, 2008 (gmt 0)

I would like to exclude all pages ending in /print.html and all pages with a ? in the url string in my robots.txt file. Is the following syntax correct for Yahoo, Google and MSN?

Disallow: /*print.html$
Disallow: /*?

 

Achernar




msg:3647853
 1:30 pm on May 12, 2008 (gmt 0)

Note that MSNbot doesn't understand this type of wildcard. M$ is still stuck in DOS-time. The bot only understand wildcard when it is used for a file extension:
Disallow: /pagename.*$

Marfola




msg:3650590
 7:15 am on May 15, 2008 (gmt 0)

Iím pretty sure thatís not true. See [search.msn.com...]

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved