Forum Moderators: bakedjake
Researchers at Xerox Corporation today unveiled FactSpotter, new document search software that goes beyond conventional "keyword" search, enabling it, in effect, to spot the one or two golden nuggets among the pebbles on the shore."Our advanced search engine goes beyond today's typical 'keyword' search or current data-mining programs, which typically end up searching only 40 percent of all the documents that are relevant because the keywords are too limiting," said Frédérique Segond, manager of parsing and semantics research at XRCE. "Xerox's tool is more accurate because it delves into documents, extracting the concepts and the relationships among them. By 'understanding' the context, it returns the right information to the searcher, and it even highlights the exact location of the answer within the document."
FactSpotter [xerox.com]
It is one (simple) thing to return Top 10 results from maybe 50-100k matches that you will have from collection of 1 mln documents, and it is completely different (very complex) problem to return same Top 10 but from a billion of qualifying pages. You can have a brilliant smart algorithm that works beautifully on 1 mln clean pages, but if you try to scale that to billions of dirty ones you will find that you need so much hardware that it will not be feasible for a long time.
FactSpotter is as alternative search engine insofar WWW is concerned as Google Desktop search is.
[edited by: Lord_Majestic at 5:09 pm (utc) on June 21, 2007]