Welcome to WebmasterWorld Guest from 22.214.171.124
Forum Moderators: goodroi
"first time caller, frequent listener"
I am little confused about handling pages with duplicate content and to what extent G considers it dupe. I have a ratings site which list specific items. The two paragraph descriptions are in many cases restatements from the businesses/suppliers, thus dupe.
So on which page should the NO FOLLOW NO INDEX be set?
The actual page with the dupe? Does that get the NINF or do the pages with links to THAT page get the NINF?
What do you recommend?
Thanks, "I'll hang us and listen to your response"
Most of the time Google does not penalize you (lower the rankings) for duplicate content. They will filter out duplicate content and when that happens they may not keep the url that you prefer. If you want to be sure your preferred url is indexed in Google then block all the other duplicates of it.
For example lets say these three urls are all showing duplicate content:
You probably would want the cleanest url in Google so you should add the no index tag to print and mobile versions.
Duplicate content within a domain is not very critical. I say this because Google will filter out the duplicates but will leave you with one version in their index. Duplicate content across different domains can be dangerous. This is because Google may filter out your domain and keep another person's domain leaving you with no representation in the search index.
It is the duplicate content across other domains that I'm concerned about. For instance, we have a particular TV model where the colorful description is just a copy of what the manufacturer created. So I have that on my site, and using copyspace?, notice a number of other sites used it. I know for a fact we've gotten black listed already.
The good thing is we are really just coming out of the gate still tweaking things and have not added a lot of content yet. So I would hope that we can start correcting the mistakes on just the few things we have, versus having noticed this after adding tons of content. It is still a lot of work if we change the content we do have, that is why I am asking about the No Index No Follow option.
I know the BEST thing to do is fix the content about 200 items.
For my own edification though, when you know flat out that you have a page which has what I now call 'Nuke Dupes'(you heard it here first), where do you put the No Index No Follow?
I can't help thinking about how all these news and entertainment sites work and get indexed at when most of them are just regurgitating new from AP and PR.