Page is a not externally linkable
Brett_Tabke - 8:14 pm on Dec 13, 2003 (gmt 0)
>prior knowledge Had none other than their seemingly bizarre out-of-the-blue purchase of Oingo/applied semantics. Semantic analysis is much much more complex than simple sentence structure. Think of it more like a grammar checker. > The big brown dog ate the little brown dogs lunch. But it can not determine which of the following is spam: > The quick brown phentermine fed the doctor and then jumped heels over head to the the xenical box and ran into the viagra wall. > The quick brown fox, jumped over the lazy dog. However, semantic analysis could flag the first as spam on-the-fly and self learn. grab a copy of Outlook 2003 and turn the spam filter up to full. It not only grabs existing spam, it "learns" what is spam in the future. Incredible hit rates. Why does 80% of it explain Florida? Because most of the utterly bizarre searches we've seen since Florida, are in sectors that have very little text on the page. Most are in keyword spaces where generated pages are the norm (travel, hotel, weather, drugs, shopping/commercial product cats, shopping cart driven purchase pages, and ultimately - every ones index pages). There just isn't enough text on those pages to make heads-nor-tails of. It explains the "dictionary" phenom and the "over optimization" phenom. I don't think it explains everything and I do agree there must have been some specific filters that nailed people in certain categories. > has google said. yes, there is that one interview with a Google rep stating their feedback from the public was positive. They also said they were out to increase the quality of the engine. If they felt there were a problem with the update, it would have been rolled back like they did those other two times. I don't think anyone has, or will have the whole story for a bit longer, but some of the fog is starting to clear. In the meantime, build content - text rich valid (nonkeyword stuffed) content.
First, lets try to stay within the scope of the thread here. I know it is a broad topic, but these threads have tended to be free-for-alls, and we are doing what we can to keep them on topic - and hence, usable.
A bayesian spam filter can spot this as spam: