Page is a not externally linkable
phranque - 7:33 am on Jul 13, 2010 (gmt 0)
you might consider using Lucene.
How To Index non-English Languages using Lucene:
http://wiki.apache.org/lucene-java/IndexingOtherLanguages [wiki.apache.org]
sorry i don't have more to offer.
i was peripherally involved a few years ago with a multilanguage site that used KinoSearch, which is based on Lucene, and i believe that was due to the capability to handle chinese language search.