Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Duplicate content vs Duplicate code

Which is triggered by which

         

photopassjapan

12:45 pm on Sep 12, 2006 (gmt 0)

10+ Year Member




For example the fact that i've been pointed out on webmasterworld:

Having the same META DESCRIPTION tag on more than one pages will push all except ( the highest level ) one to the omitted results page for being "too similar"... completely regardless of its content, title, size and whatnot.

I've checked and a single word of difference seemed enough for Google to believe that the page isn't that similar after all. Not that i've experimented with it, only it was obvious once i checked the results.

Furthermore it's not some dreadful flag, for the pages ARE indexed in Google, and if you do a search that's relevant only to such a "lower level" omitted page... it does come up. It comes up when the higher level page is no longer relevant.

I'm puzzled, just what IS dupe content? If the triggers for duplicate content can't handle different code... nor can the duplicate code filter handle a single word of difference. I did a search on an essay i wrote for a site some time ago... and found it on three different corporate websites as their "mission" or their about us page... ( No kidding. Talk about lack of self-awareness or creativity. ) My text was basically copy/pasted into a page with some different code around it, and from that point it wasn't dupe content i guess. Their PR wasn't 0, they were indexed in Google.

...But i do have an exact question this time as well.
This might be interesting for us for we have thousands of pages for each larger pic on our site, the title and content is unique, the description is not, and we need to find a way to make it so. But we don't have the resources to write thousands of unique descriptions AGAIN that would be different from the title, this is a non-commercial site.

The question is...

If the page meta description is unique and relevant to the page... but it's the same as the page TITLE... that's probably spam, right?

But... if the page TITLE is completely unique to all pages... and we'd like it to be featured in its whole BUT with some additional information in the META DESC tag...

Is that a no-no as well? We've had enough trouble already and the least i'd want when trying to reach SOME more people is to be banned by some algo for overdozing on relevancy :-P

I mean are pages being moved to supplemental results / omitted results / out of the index because of such a thing?

g1smd

8:34 pm on Sep 12, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



There are many types of duplicate content. I wrote about this at length some months ago. Check those posts for additional information.

There are the "exact duplicates", due to poor site design:
- same content at URLs with multiple different parameters (use meta robots noindex on all but one version),
... eg: /shirts.php?size=16&colour=blue vs. /shirts.php?colour=blue&size=16
- same content at www and non-www (use 301 redirect to fix)
- same content at multiple domains (use 301 redirect to canonical domain)
- http vs. https (use a 301 redirect or 404 on https to fix this)

.

There are "pseudo-duplicates" caused by have the same title and/or same meta description on multiple pages.

These are easy to fix. Make sure that every title tag and every meta description is unique.

The indication that you have this problem is seeing the "click for omitted results" link early on in a site:domain.com search (e.g. "Results 1 to 4 of about 750 pages").

.

There are "syndication duplicates", where the same news article is posted to dozens or hundreds of sites.

Since the navigation code, and the page filename, and the site structure, and the on-page code is different, then this is not a true duplicate and Google will happily list multiple copies, but they do filter at least some of them out.

The exact filtering criteria is completely unknown.

g1smd

11:08 am on Sep 13, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



"Exact duplicates" also include "variable case URLs" as allowed by IIS for example.
e.g. /ThisFile.html vs. /thisfile.html

Apache is totally immune to that particular problem.

glengara

11:23 am on Sep 13, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



*.. if the page TITLE is completely unique to all pages... and we'd like it to be featured in its whole BUT with some additional information in the META DESC tag...*

Sounds safe enough to me, particularly if you're not repeating KWs in the individual tags...

photopassjapan

3:32 pm on Sep 13, 2006 (gmt 0)

10+ Year Member



Thanks, i guess it's pretty much covered then...

mbucks

6:01 pm on Sep 13, 2006 (gmt 0)

10+ Year Member



Just one question on this. If the meta description and title on the page are the same, is google likely to punish you for it? It sounds like a pretty good way of telling the search engine what it's about. Especially on large dynamic sites.

g1smd

6:40 pm on Sep 13, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



They can be the same, but I would expect the description to be about 3 or 4 times longer than the title.

mbucks

7:40 pm on Sep 13, 2006 (gmt 0)

10+ Year Member



Yep, makes sense!

How about if the meta description is a short snippet from the start of the content on the site.

Though prob not the best way of utilising the description function, but a sure-fire way of making each page different.

g1smd

8:15 pm on Sep 13, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



That would probbaly work in the short term.

In the long term I would want a "high quality" meta desciption and hence snippet, as that is what searchers first see in the search results.