Page is a not externally linkable
- Search Engines
-- Asia and Pacific Region
---- Searching in asian languages


Spearmaster - 6:51 pm on Nov 1, 2002 (gmt 0)


Like I mentioned - it's not the spacing.

All are double-byte. However, two keystrokes in Japanese, Chinese or Korean (or CJK) equals a word.

Two or more keystrokes in Thai equals a letter, and NOT a word. Otherwise we could simplify the input system to CJKT instead of CJK :) Vowels in Thai are modifiers of consonants, somewhat similar to French accents. So a single "letter" could conceivably take four keystrokes.

Simply put, both rely on patterns for searching. But it will be actually more difficult to match a Thai word - or part thereof - as opposed to a CJK two-byte word - simply because there are many more keystrokes involved.

Thai is like English without the spacing - a string of letters. CJK is a string of words.

CJK is wordwordword. Thai is abcdefgabcdefg etc.


Thread source:: http://www.webmasterworld.com/asia_pacific_search_engines/284.htm
Brought to you by WebmasterWorld: http://www.webmasterworld.com