Forum Moderators: goodroi
I want to block google, yahoo, msn and ia_archiver from spidering the forum. How does this look for syntax?
User-agent: *
Disallow: /forum/
User-agent: Slurp
User-agent: Googlebot
User-agent: msnbot
User-agent: Mediapartners-Google
User-agent: Adsbot-Google
User-agent: ia_archiver-web.archive.org
Disallow:
User-agent: *
Disallow: /
User-agent: Slurp
User-agent: Googlebot
User-agent: msnbot
User-agent: Mediapartners-Google
User-agent: Adsbot-Google
User-agent: ia_archiver-web.archive.org
Disallow: /forum/
User-agent: *
Disallow: /
You can always verify how Googlebot will interpret a robots.txt file using the robots.txt analysis tool in Google webmaster tools. You can just add the site you're interested in to your account, paste the test file in into the tool, and check specific URLs to see if the test file would block or allow them.
There are plenty of badly-coded 'bots out there that are not really malicious, just incompetent...
Jim