Forum Moderators: Robert Charlton & goodroi
With MC blog it seems to be a Cloaking and who owns the original content issue - rather than picking the wrong canonical/main url for the site.
Picking the wrong url means crawling and ranking problems - which means you cant outrank sites that may have stolen your content. (Rather than sites stealing your content leading to a downranking.)
EG. The picking of the wrong url leads to the downranking not the stolen content. Picking of the wrong url can happen due to external links going to the wrong place (maybe 302 - well deffo in the past), to many internal links going to the wrong place etc - but not by stolen content (IMO)
The update is finally over, phew. Results from this DC: [216.239.37.104...] will be the final ones, except that, for one, new sites that have been previously "sandboxed" or have had other tech problems(which account for a small minority of the total sites indexed) will be added into the index. This will cause a minor shuffling, but pretty much, the fat lady has sung. At least in my neck of the Internet woods.
<edit - by "over", I mean in the sense that I can stop watching it, since I now already know what will happen>
[edited by: spaceylacie at 6:56 pm (utc) on Nov. 2, 2005]
Perfection: If your site is perfectly ok for Google doing nothing for years would be fine - never touch a running system ;-)But maybe there are some issues like duplicate content, that have been detected by Google right now. WW is full with postings about old sites that vanished surprisingly from the index. (Mine has been hit too.)
Taps, exactly. The site was doing what I wanted it to do for almost 3 years, so I had no reason to add or change anything.
But about duplicate content... the only issue there is countless websites (some quite spammy) steal from my site word for word. I always figured this wouldn't cause me any problems as far as rankings go (and it hasn't for 3 years now) but, is this what you mean by dup. content? Or are you referring to something else?
Also, this site hasn't vanished on the new DCs. All of the pages remain there, just pushed back an insane amount (3-10 pages) on literally every single search term. These are the same terms that I was prevously in the top 10 for, for years.
Do a site:domain.com -www - any entries - eg your homepage?
So the fact that such pages held high-rankings for so-long indicate things were broken before. Removal of such sites is actually an indicator of improvement.
Unless your site is based on ancient history.
Internal dupe content is easier to fix. Check your site as Dayo told. Also, if you use php or another language, check if your content can be accessed on different ways, for example:
www.widget.com/show_article.php?id=0
and
www.widget.com/another_filename_for_this_article_id0.html
After that check for print versions of your articles...
There's always some dupe content to be found ;-)