Page is a not externally linkable
annej - 4:15 am on Feb 13, 2007 (gmt 0)
It just occurred to me that I have been assuming it has to do with density but now I'm wondering if it is talking about an absolute occurrences of the phrases. Since we don't know what the phrases are would it help to break articles into two pages in hopes of not going over the limit of related phrases? I don't know if that even makes sense. I'm grasping at straws here.
The "Detecting spam documents in a phrase based information retrieval system" patent application talks about "comparing the actual number of related phrases in a document with the expected number of related phrases"