Forum Moderators: open
Alta is giving them a lot of impressions by doing this, so I figure they are making some significant contribution and I started poking around a bit. Teragram is a Boston, MA company specializing in computational linguistics, and one of the favored varieties of computational linguistics today called "optimality theory".
Now I don't know what effect Teragram's work has on the AltaVista search results, but I do know that optimality theory could account for some odd things. It could weight one search term in a query as being more important or meaningful than another. So, searching on A +B might be highly weighted toward A, whereas searching on A +C might be highly weighted toward C.
I realize my post here might be about nothing at all, but when I notice that much favor being given at a high corporate level, I get curious. I did manage to discover that Teragram helped AV develop their "Discovery" product in 1998, so they've been partnering for a while.
DH and Teragram do seem to be interested in the same questions: "What did that search query really mean? What answers do the searchers seem to like?" DH goes about it by tracking actual search records and Teragram works to solve it upfront by pulling language apart. The two approaches would be a good fit.
Teragram site says they do context-sensitive spelling programs, stemming software, and HTML parsing for text extraction at a rate of 50,000 words per second (whew!). I can see where Alta might have some use for this stuff.
If searches are moved into the realm of meaning, rather than just matching character strings, the search engine becomes a very different critter. SEO becomes much more like writing clear and direct copy and much less like playing "Guess the Algorithm".
I think there's a very good chance that this is evolution is well underway, and that this is what makes search results seem so inscrutible at times.
This is already in affect on AV. IDF (inverse document frequency) Gives words found on fewer documents in the AV database more pull than common words. This is why I reccomend optimizing for words in your search phrase seperately.
Maybe they spell check your page.
:)
Also, check out Teragrams [teragram.com] product offering. It is a what's-what of linguistic software.