Rosette performs linguistic analysis that helps information retrieval applications understand search queries. For example, Rosette identifies individual words for languages such as Japanese that do not use spaces between words, breaks compound words into their individual components, and identifies parts-of-speech such as verb, adjective, etc.
[edited by: Brett_Tabke at 1:05 am (utc) on Nov. 26, 2004] [edit reason] added quote [/edit]
I heard from Carl Hoffman the CEO of Basis Technology that:
Google uses our software for Japanese and Korean, in addition to Simplified Chinese and Traditional Chinese, for segmentation, morphological analysis, and stemming. Google also uses our "Core Library for Unicode" to handle transcoding between legacy encodings and Unicode.