Forum Moderators: goodroi
So my other thought was to disallow every page of the site, but then allow individual instructional/info pages.
Something along the lines of:
User-agent: *
Disallow: /
User-agent: *
Allow: /info/step1
Allow: /info/step2
Allow: /info/step3
Is this feasible? Will this confuse the crawlers? Any help would be greatly appreciated.
-swipe
[edited by: SwipeTheMagnets at 5:57 pm (utc) on July 1, 2009]
Whenever you make changes to your robots.txt file it is wise to monitor crawler behavior for a few days to make sure everything is fine. One typo can potentially cause nightmares.
you should use the robots.txt checker [google.com] available in your Google Webmaster Tools dashboard.
if i recall correctly, your robots.txt file will exclude everything for all robots since it uses the first match in the order specified.
you might want to try something like this:
User-agent: *
Allow: /info/step1
Allow: /info/step2
Allow: /info/step3
Disallow: /