Forum Moderators: open
The newsletter will usually include net-new content, but some of it may appear on other parts of my site (i.e. the normal articles area).
Will I be smacked down for duplicate content? The newsletter archives present the whole newsletter (including featured articles) all on one page,just like in email form, whereas the article engine has a separate page for each article.
Please advise, thx much.
shasan
I thought of many algos but still banning a site for duplicate content is tough. But google is a big boss with a big algo, I still doubt whether google penalise for duplicate content.
As duplicate content has no proper definition.
Aji
I just want my newsletter archives available to people, and the archives may contain items published (previously) on other parts of my website. Would that stand up to a 'handcheck'? Should I even worry about it?
<grammar edit>
It is unclear how Google's algo decides which page to prefer. We have had quite a few discussions about that here at WW. It seems that age will often be a considerable factor but not necessarily the one determinant factor.
I personally know of at least two articles that have been published by their authors on the web in two different places. In one case the oldest version ranks high in Google while the newer one is literally invisible. In the other case it's the other way around.
The algo does not distinguish between stolen content and legit content. That will have to be an individual decision made by a human.
Having added a hidden div (popup help) on a page, I decided to play ultra-safe and added a robots meta tag to exclude it from Google's index. There is a 99% chance that Google would not have worried about this as hidden text, however, the page was unimportant (for search engines) so caution seemed sensible.
If you have duplicate content that you don't want/need indexed, just exclude it by one of the methods above.
Kaled.
Only Google know. And there is no doubt that Google constantly are refining their filters so that they catch as much duplicate content as possible while on the other hand avoid to suppress genuine content. This may mean that if you write a long article that has a rather long quote from another article, that quote and only that will be filtered as duplicate content and downgraded in the SERPs.
The newsletter will usually include net-new content, but some of it may appear on other parts of my site (i.e. the normal articles area).
I have a similar situation where one page lists many items and other pages list one of those items and when I search for some keywords from the text I find them both listed, general page first, detail page later, perhaps because that one has a longer uri or some of the keywords appear more often of the general page.
One time i'm in the top5 with 18 total results another time the detail page isn't considered relevant even though there are only 2 relevant results, of 4 found total.
Thus it's hard to understand what is going on precisely..
I don't get the impression that there is a penalty issue going on though..
Max