Welcome to WebmasterWorld Guest from 54.162.214.65

Forum Moderators: goodroi

Message Too Old, No Replies

robots.txt syntax for wildcards?

     
10:57 am on May 12, 2008 (gmt 0)

Junior Member

10+ Year Member

joined:June 14, 2006
posts: 85
votes: 0


I would like to exclude all pages ending in /print.html and all pages with a ? in the url string in my robots.txt file. Is the following syntax correct for Yahoo, Google and MSN?

Disallow: /*print.html$
Disallow: /*?
1:30 pm on May 12, 2008 (gmt 0)

Full Member

5+ Year Member

joined:Dec 3, 2006
posts:257
votes: 0


Note that MSNbot doesn't understand this type of wildcard. M$ is still stuck in DOS-time. The bot only understand wildcard when it is used for a file extension:
Disallow: /pagename.*$
7:15 am on May 15, 2008 (gmt 0)

Junior Member

10+ Year Member

joined:June 14, 2006
posts: 85
votes: 0


Iím pretty sure thatís not true. See [search.msn.com...]