Forum Moderators: goodroi

Message Too Old, No Replies

robots.txt syntax for wildcards?

         

Marfola

10:57 am on May 12, 2008 (gmt 0)

10+ Year Member



I would like to exclude all pages ending in /print.html and all pages with a ? in the url string in my robots.txt file. Is the following syntax correct for Yahoo, Google and MSN?

Disallow: /*print.html$
Disallow: /*?

Achernar

1:30 pm on May 12, 2008 (gmt 0)

10+ Year Member Top Contributors Of The Month



Note that MSNbot doesn't understand this type of wildcard. M$ is still stuck in DOS-time. The bot only understand wildcard when it is used for a file extension:
Disallow: /pagename.*$

Marfola

7:15 am on May 15, 2008 (gmt 0)

10+ Year Member



I’m pretty sure that’s not true. See [search.msn.com...]