Methods and systems for identifying manipulated articles

Abstract - Systems and methods that identify manipulated articles are described. In one embodiment, a search engine implements a method comprising determining at least one cluster comprising a plurality of articles, analyzing signals to determine an overall signal for the cluster, and determining if the articles are manipulated articles based at least in part on the overall signal.

William Slawski at SEO by the SEA has an excellent overview of this new patent that was just granted yesterday (2007-11-27)...

Google Patent on Web Spam, Doorway Pages, and Manipulative Articles
[seobythesea.com...]

The identification of manipulative documents, how they might be grouped together, and how they could be treated by the search engine is described in some detail. That treatment might include removal of pages from the search index, reductions in rankings for pages, and possibly a change in how quality scores (PageRank) are calculated for links from manipulative pages.

It is definitely worth a read and contains some very interesting information. All of Google's Patents contain interesting information. Slawski has a great way of deciphering them and explaining them to his audience. :)

Methods and systems for identifying manipulated articles

Google U.S. Patent Number 7,302,645

pageoneresults

jimbeetle

dibbern2

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week