Forum Moderators: phranque

Message Too Old, No Replies

Duplicate pages within my site

Looking for guidence on solutions

         

stuartmcdonald

11:45 pm on Oct 7, 2004 (gmt 0)

10+ Year Member



My site has a substantital number of pages with identical content on different URLs, and Google isn't indexing either version (there could be other reasons, but duplicate content is the most obvious).

The reason I have duplicate stuff is because I'm using URL rewriting to serve db content and so the structure allows for some content to be served twice at different addresses.

It's a travel guide site (url in profile) and the problem centres around the accommodation listings portion. Users can get a list of accommodation catergorised by price (cheap, mid range, expensive, all, etc) and from the respective lists they can get a detailed page on the particular guesthouse. The problem revolves around the "all" page (which lists all guesthouses) as if a user requests a detailed profile from the all page, the profile page is identical to the served if they had selected the property from the cheap price range, but the URL is different.

Hence the problem.

At the moment, I'm thinking of adding a no-index command on all the "all" summary pages, thus only the page coming off from the price bracket will appear to Google, but was wondering if anyone has any better suggestions.

Hope the above makes sense.

Thanks in advance

trillianjedi

10:51 am on Oct 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I don't actually think that duplicate content is likely to be your problem here, but if you want to at least eradicate it as a possibility can you put a directory in any DB URL's and use robots.txt to stop the bots indexing those non-rewritten links?

TJ

stuartmcdonald

11:09 am on Oct 8, 2004 (gmt 0)

10+ Year Member



Not really as all the db urls are in the root directory, and there's at least 500+ of them - they grow each time we add another place!

You suggest they may not be the problem - I thought duplicate content was the most common cause of pages listed in Google without description (ie they're not in the index)

trillianjedi

11:11 am on Oct 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



No - usually only one of them is indexed, or they are both indexed and only one of them ranks.

TJ

stuartmcdonald

11:17 am on Oct 8, 2004 (gmt 0)

10+ Year Member



Ahh ok, thanks for that