I have to deal with the French "SPIP" CMS of the company I'm working for. It gives me a lot of headaches when it comes to SEO: urls with ampersands and question marks, no folder hierarchy, and so on.
My main concern is duplicate content: The CMS duplicates every page with a printer-friendly version, a send-to-a-friend version, and so on.. .
There is another (severe?) problem: let us say i have a page called http://www.example.com/part.php3?id_secteur=1.
Well, i have noticed to my astonishment, that when i change the url to http://www.example.com/part.php3?id_secteur=2,
http://www.example.com/part.php3?id_secteur=3,..and so on,
the same content is served! But I don't need those countless versions of the same page. The only page where I link to is the number 1. I suppose numbers 2 and 3 will never be considered by the Googlebot, but i'm scared anyway.
What can i do?
I was thinking of using the robot.txt file extensively in order to exclude all folders with dup content? Or is there another way to proceed?
(The site is an authority site in its sector, has 2.000 unique content pages, more than thousand inbound links from quality sites, and it gets no more than.... 500 uniques a day! )
Thanx in advance for the excellent advice, common on WebmasterWorld!
[edited by: jatar_k at 6:37 am (utc) on June 22, 2006]
[edit reason] examplified [/edit]