Forum Moderators: open
Why not add to the page or make it different. Add some value to the page, above and below the dup' cont' and it will rank.
I am not sure of the %age difference required but it cannot be that great, I see the wiki scrapers etc ranking well!
Unless you have loads of dup content then in my experience just the page gets dropped
I'd second that. The pages are still in the SERPs, but they're listed as URLs only.
There does seem to be a threshold of duplication where a whole site can be penalized though, regardless of whether it has a small amount of unique content on or not.
What kind of steps? Not linking the pages into your site structure; rewriting requests for dynamic URLs to the static URLs; blocking spider access with either robots.txt or robots meta tags. Any one of the options would do the trick.
In fact the texts were rather long so I broke them down into bite-size chunks (with "next" and "previous" links), so they represent a lot more pages per text than do the corresponding sets of pages of my opponents. Whether this helped to reduce the duplication as seen by Google, I can't say for sure but I doubt if Google is too concerned about this sort of thing. I believe the risky sort of duplicate content is where pages are more or less identical in all respects, giving rise to the spam suspicion.
Doing a Google search for a piece of one of my classic texts - search without speechmarks - my page is #1 out of 33,700. Search with speechmarks and Google returns only two results (my page not returned) but with an option to repeat the search with the omitted results included. This returns 48 pages on various sites that contain that classic text - my page is #4 out of 48.
Purely anecdotal, but circumstantially suggests that this is not seen as duplicate content of the spam variety, ie which might hurt a site more generally.