Forum Moderators: goodroi
I want search engines to index each message thread but unfortunatly there are 'edit' and 'delete' links on each message and Google has followed each of these and indexed each of the page it throws up - this is using huge bandwith for no reason.
So i thought i could use the robots.txt to stop it but i am new to robots.txt so dont really know how.
The message board pages i want indexed have the URL:
[www.mydomain.com/cgi-bin/mb/mesage_number.html]
when you click an edit function you get the URL:
[www.mydomain.com/cgi-bin/mb/edit.pl?query_string]
Is there a way i can stop indexing of all the edit pages easily?
Another way i have thought was to add something in the <head> tag to stop search engines following links off each of these pages. Is there such a command?
All messages have a master template so it would be simple to add. However this would only cover new messages not all the old ones.
I will report back how i get on, to give an idea of the scale the are about 400 legitimate pages on my site including message threads - Goolge currently reports over 2000 pages. SO at least 1600 of them are simply it following the edit and delete links on each message in every thread and indexing that page.
If someone is looking at changing the script then i also have another problem with it - the old one used to allow ten messages then start a new page for message 11-20 etc...
This new version (v5.0) after 10 creates a new page for each and every extra message - so #11 on page 2, #12 on page 3 etc...
I currently get around this by having unlimited messages per page but this leads to gigantic page sizes that are not good for users.