I would like to exclude all pages ending in /print.html and all pages with a ? in the url string in my robots.txt file. Is the following syntax correct for Yahoo, Google and MSN?
Disallow: /*print.html$ Disallow: /*?
Achernar
1:30 pm on May 12, 2008 (gmt 0)
Note that MSNbot doesn't understand this type of wildcard. M$ is still stuck in DOS-time. The bot only understand wildcard when it is used for a file extension: Disallow: /pagename.*$
Marfola
7:15 am on May 15, 2008 (gmt 0)
I’m pretty sure that’s not true. See [search.msn.com...]