Forum Moderators: Robert Charlton & goodroi
However, I've noticed in some niches (real estate and car reviews) that the top results are often page after page of duplicated content. Is Google not able to spot this and flag it as duplicate?
Or is the issue not even really duplicate content but rather the fact that for those search queries there aren't other more competitive pages to take the place of the duplicates in the search results....
which may be another way of wondering if google really does much at all about duplicate content---as I said some niches are full of duplicate pages in the first ten spots for certain phrases:
why can't google catch this, if it can at all?
It sometimes feels like Google takes off the restrictions in some markets and just allows those domains to have a no-hold-barred steel cage death match. I've heard this opinion expressed frequently, but never officially or with any real authority.
These are my names for them, and I think they express what they are quite well.
- "exact duplicates" - these are www vs. non-www, multiple domains, similar parameters, capitalisation issues, http vs. https, etc, and some get delisted, some turn URL-only, and some show as Supplemental.
- "pseudo duplicates" - these are where many pages on the same site have the same title tag and/or same meta description as other pages (even though the page content itself might be very different) and for these most get hidden behind a "click for omitted results" link, and some might get dumped to Supplemental.
- "syndicated content" and "site scrapers" - these are where the domains are owned by other people and the content is not an exact byte-for-byte copy. The site navigation may well be different, and the page HTML code is likely to be different too. For these, Google might list quite a few before deciding to filter some out. You see this with press releases and newswire stuff. It's interesting to see what sinks and what swims. In some areas I don't think they apply a heavy enough filter for these - as you have noticed.