TheOptimizationIdiot - 4:12 am on Feb 27, 2013 (gmt 0)
It's just that I don't see Google inflicting a true penalty on a site just because there's an exact match between H1 and title.
Do you think it could be something more along the lines of a combination of things and the H1 is just the first thing people change when adjusting other aspects of the phrase based spam detection could have the same impact?
 In the first section (first m entries), the following relevance attributes are stored for each document entry in the posting list of a given phrase:  1. The document relevance score (e.g., page rank);  2. Total number of occurrences of the phrase in the document;  3. A rank ordered list of up to 10,000 anchor documents that also contain the phrase and which point to this document, and for each anchor document its relevance score (e.g., page rank), and the anchor text itself; and  4. The position of each phrase occurrence, and for each occurrence, a set of flags indicating whether the occurrence is a title, bold, a heading, in a URL, in the body, in a sidebar, in a footer, in an advertisement, capitalized, or in some other type of HTML markup.
 M(p): Number of interesting instances of the possible phrase. An instance of a possible phrase is "interesting" where the possible phrase is distinguished from neighboring content in the document by grammatical or format markers, for example by being in boldface, or underline, or as anchor text in a hyperlink, or in quotation marks. These (and other) distinguishing appearances are indicated by various HTML markup language tags and grammatical markers. These statistics are maintained for a phrase when it is placed on the good phrase list 208.