Forum Moderators: goodroi
Board Does Not Exist.
Make sure you did not mis-type the URL.
How can I add robots.txt to my board? Is there another way to go? I basically want to prevent web crawlers and also cached pages from appearing anywhere.
Thank you!
But...I got the idea when I was looking at another invisionfree forum. First I got this:
Robots.txt Query Exclusion.
We're sorry, access to [google.com...] has been blocked by the site owner via robots.txt.
Read more about robots.txt
See the site's robots.txt file.
Try another request or click here to search for all pages on google.com/search?q=related:http://sub.example.com/borad name/
See the FAQs for more info and help, or contact us.
Then...I went to their log:
Th's robot.txt log:
User-agent: *
Disallow: /search
Disallow: /groups
Disallow: /images
Disallow: /catalogs
Disallow: /catalog_list
Disallow: /news
Disallow: /addurl/image?
Disallow: /pagead/
Disallow: /relpage/
Disallow: /sorry/
Disallow: /imgres
Disallow: /keyword/
Disallow: /u/
Disallow: /univ/
Disallow: /cobrand
Disallow: /custom
Disallow: /advanced_group_search
Disallow: /advanced_search
Disallow: /googlesite
Disallow: /preferences
Disallow: /setprefs
Disallow: /swr
Disallow: /url
Disallow: /wml
Disallow: /hws
Disallow: /bsd?
Disallow: /linux?
Disallow: /mac?
Disallow: /microsoft?
Disallow: /unclesam?
Disallow: /answers/search?q=
Disallow: /local?
Disallow: /local_url
Disallow: /froogle?
Disallow: /froogle_
Disallow: /print?
Disallow: /scholar?
Disallow: /palm
Disallow: /complete
Disallow: /sponsoredlinks
Disallow: /videosearch?
Disallow: /videopreview?
Disallow: /videoprograminfo?
Disallow: /maps?
Sooo...how did they do it? And what are some of the weird things they seem to have excluded?
Thank you so much!
[edited by: ThomasB at 7:31 am (utc) on Sep. 29, 2005]
[edit reason] examplified [/edit]