Forum Moderators: goodroi

Message Too Old, No Replies

Trying to add robots.txt to invisionfree forum

can't get it to work

         

Damaris

8:04 pm on Sep 28, 2005 (gmt 0)

10+ Year Member



I followed the directions that I found here and tried to go to "http://s11.invisionfree.com/BOARD NAME/robots.txt"
and got the message below:

Board Does Not Exist.
Make sure you did not mis-type the URL.

How can I add robots.txt to my board? Is there another way to go? I basically want to prevent web crawlers and also cached pages from appearing anywhere.

Thank you!

Dijkgraaf

10:32 pm on Sep 28, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



robots.txt always has to be at the root directory of the website, e.g.
http://www.example.com/robots.txt
So I doubt that in your particular situation you would be able to use a robots.txt
You would have to ask the provider to add an exclusion to their robots.txt file.

Damaris

11:00 pm on Sep 28, 2005 (gmt 0)

10+ Year Member



Ah phooey. So I would need to ask invisionfree, or buy the domain for my board? I think invisionfree sells them.

But...I got the idea when I was looking at another invisionfree forum. First I got this:

Robots.txt Query Exclusion.

We're sorry, access to [google.com...] has been blocked by the site owner via robots.txt.
Read more about robots.txt
See the site's robots.txt file.
Try another request or click here to search for all pages on google.com/search?q=related:http://sub.example.com/borad name/
See the FAQs for more info and help, or contact us.

Then...I went to their log:

Th's robot.txt log:
User-agent: *
Disallow: /search
Disallow: /groups
Disallow: /images
Disallow: /catalogs
Disallow: /catalog_list
Disallow: /news
Disallow: /addurl/image?
Disallow: /pagead/
Disallow: /relpage/
Disallow: /sorry/
Disallow: /imgres
Disallow: /keyword/
Disallow: /u/
Disallow: /univ/
Disallow: /cobrand
Disallow: /custom
Disallow: /advanced_group_search
Disallow: /advanced_search
Disallow: /googlesite
Disallow: /preferences
Disallow: /setprefs
Disallow: /swr
Disallow: /url
Disallow: /wml
Disallow: /hws
Disallow: /bsd?
Disallow: /linux?
Disallow: /mac?
Disallow: /microsoft?
Disallow: /unclesam?
Disallow: /answers/search?q=
Disallow: /local?
Disallow: /local_url
Disallow: /froogle?
Disallow: /froogle_
Disallow: /print?
Disallow: /scholar?
Disallow: /palm
Disallow: /complete
Disallow: /sponsoredlinks
Disallow: /videosearch?
Disallow: /videopreview?
Disallow: /videoprograminfo?
Disallow: /maps?

Sooo...how did they do it? And what are some of the weird things they seem to have excluded?

Thank you so much!

[edited by: ThomasB at 7:31 am (utc) on Sep. 29, 2005]
[edit reason] examplified [/edit]

Dijkgraaf

12:58 am on Sep 29, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hmm, good question. Maybe you should ask the owner of that board :-)
Not sure why some of those items are excluded, but they are possibly specific to that board.

Damaris

1:34 am on Sep 29, 2005 (gmt 0)

10+ Year Member



I can't...*heavy sigh*