aakk9999 - 2:13 am on Jan 24, 2011 (gmt 0)
how a page can rank well for years and then suddenly be displaced from the SERP as a "duplicate" of a new page on a scraper site.
I was wondering too. On one of personal websites I wrote a blog post on some price comparisons on some general items between countries. It was an original post where we decided on a basket of products and actually went to the shop with a list and pencil and researched prices. Wrote an article that was ranking #1 for two years until someone scraped it to Yahoo answers. They have not changed that article, it is word-for-word copy. And this scraped page replaced our page as #1.
I can understand Google issues where a site publishes article and it gets scraped almost immediately. Where the timeframe between original and scraped is short, it is difficult to tell who is the originator as it may depend on how often the site is crawled e.g. if a site that is crawled every few minutes scrapes someone's content, and the originating site is crawled twice per week - there is no way Google will able able to tell with certainty who is the originator.
But for article that was there 2 years and then scraped - this is obvious, isn't it?
The only way I can see this being solved is if Google provides some kind of "ping" service where you ping the site with new URL before you interlink it to your site. In that way Google can get to it, but no-one else knows about it (yet) and you interlink URL after x amount of time.
But as I am writing this, I can see this is too complicated and does not solve the issue of changed content on already interlinked (known) URLs.