Page is a not externally linkable
Oliver_Henniges - 8:49 pm on Feb 17, 2007 (gmt 0)
My sphere is the other end: Product pages with relatively thin content. Hardly any phrase related to the term "widget" will be likely to occur on the page at all. Which definitely is not the case on most of my product pages. Nevertheless my pages still do fine in the SERPS (though I admit its a niche market). So all in all evidence, that at least the spam detection patent is quite likely to be applied to only a strongly selected set of phrases. Which is only natural: It is limited to spammy phrases. But for me the question remains open, to what extent phrase-based IR can be applied to such an enormous amount of words and their possible combinations in an environment like the internet.
> ...because I'm working with articles. With them there is usually a good variety of related phrases... So if you have a bunch of different "good phrases" on a page, but the page does not contain some of the other words and phrases that the "good phrase" is indicative of, a filter catches it.