--- Comment #3 from Peter S. Housel <hou...@acm.org> 2009-08-31 10:19:51 PDT ---
(In reply to comment #2)
> The Unicode UAX#14 indicates that proper line breaking for the Thai language
> involves morphological analysis in order to determine word boundaries. The
> standard considered this as too complex and left it to the "higher levels
> of processing".
> The libthai project (http://linux.thai.net/projects/libthai) produces open
> software for this purpose, written in C/C++, which is used by Mozilla, Gnome
> applications and other OSS. Apparently, Java applications aren't as easily
> supported, yet.
The com.ibm.icu.text.ThaiBreakIterator class in recent versions of ICU4J can
supposedly do this. It makes use of an included dictionary of Thai words in
order to locate valid break points.
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.