"Domain Association" in Google.

For my 6000th WebmasterWorld post. Crikey!

If you have a website that responds with a "200 OK" at both domain.com and at www.domain.com then you have Duplicate Content. In effect you have two sites competing against each other in the SERPs. This has been covered many times before.

You will find that random pages from each "site" are listed in the SERPs, and that many are marked as Supplemental Results. When listing pages using a site: search, you will find that site:www.domain.com and site:domain.com -inurl:www give completely different listings.

Additionally, the Pagerank for domain.com/somepage.html and for www.domain.com/somepage.html are likely to be completely different for almost every sample of somepage, even for the root index page.

Finally, the results for link:www.domain.com and for link:domain.com are also likely to give completely different results too.

These are things that we have known about for years.

In the distant past, Google tried hard to "associate" related sites (such as www and non-www) as being "one site".

Some three or four years ago they used to run a process over their database, several times per year, to fix these associations. However, they stopped doing that long ago.

Duplicate Content issues really started to bite around the time Google introduced the Supplemental Index. It was at that point that using a 301 redirect from non-www to www (or vice versa) started to become essential. It made sure that all pages of a site were listed as www (or as non-www if the redirect was reversed).

That procress has been discussed many times here too.

Once Google associates those two "sites" as being the same site, they then list the same backlinks, whether you ask for link:domain.com or for link:www.domain.com. This occurs even though the links only really go to one particular domain, and there are none pointing at the "ghosted" site(s).

Additionally, if a site has a .com domain and a .co.uk domain, or any other combination of domains, such as a main site and some common mis-spellings, the same 301 redirects are also required to avoid all of those Duplicate Content issues disussed above.

Obviously, once Google makes those additional associations, a request for link:anydomain.com for any of the related domains will give exactly the same list of backlinks, even though all of the links actually only go to one particular domain.

There is an interesting question as to how the internal mechanism works when you request the backlinks list for one of the "associated" sites.

For example, when Google associates domain.co.uk with the main site www.domain.com and starts to "list" the same backlinks for both searches in the SERPs, do they:

1. Copy all the data for www.domain.com (the main site) to a separate "file" for domain.co.uk and show that file when link:domain.co.uk is requested, OR

2. When people request data for domain.co.uk simply show the data for www.domain.com (the main site) instead.

The results for two searches are the same; but are you looking at a separate copy of the data, or are you just being redirected to the original backlinks list as held for the main site?

This also has important repercussions as to how backlinks listed for the associated sites are treated if the 301 redirect that completes the association is ever removed.

For a while, Google will be showing an incorrect backlink list, which may lead to some confusion if you weren't aware that a redirect had previously been in place.

How long does it take Google to realise that the redirect has gone?

How long does it take for the backlink list to be recompiled for the "associated" site, or for the association to the "other" backlink list for the "main site" be broken?

Even if you see something listed in the public SERPs, I still don't believe that that is always the same data that Google is using internally, or that a site may be getting the benefit that you think it might be getting from what you think you see.

Discuss.

"Domain Association" in Google.

How does that relate to Duplicate Content issues?

g1smd

g1smd

rainborick

CainIV

Drew_Black

CainIV

g1smd

Marcia

Miamacs

youfoundjake

g1smd

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week