Forum Moderators: open
Mike Grehan has a very informative interview with Paul Gardi, SVP Search at Ask Jeeves/Teoma online:
[e-marketing-news.co.uk...]
At the end of the interview there's a link to a free pdf with more information about HITS and linkage based algorithms.
<fixed spelling>
[edited by: tedster at 5:12 pm (utc) on Dec. 30, 2005]
I also always assumed that there was a pinch of HILLTOP [cs.toronto.edu] thrown in to limit maniplation by affiliated websites, but I can't find confirmation anywhere. Teoma's "topic distillation" seems more to be an alternative to the Hilltop approach.
Beyond that, as I said earlier, the big deal for Teoma was creating a way to retrieve and cluster the results with a runtime measured in seconds rather than minutes -- but that is more operational rather than algorithmic. I don't think anything like exact sourcecode is publicly available.
Another good starting point is this pdf, also from Mike Grehan:
[searchguild.com...]