https://bugs.documentfoundation.org/show_bug.cgi?id=95159
Samphan Raruenrom <[email protected]> changed:
What |Removed |Added
----------------------------------------------------------------------------
Severity|minor |trivial
--- Comment #8 from Samphan Raruenrom <[email protected]> ---
LibreOffice relies on ICU to break Thai words. ICU uses a greedy
dictionary-based longest-matching Thai word-segmentation algorithm, e.g. it
stops when found the first possible segmentation "การก|ระ|ทำ" (การก happens to
really be a valid Thai word).
To fix this issue, one needs to implement a slightly better maximal-matching
algorithm in ICU. It will still fail in some other rarer cases anyway.
BTW, this kind of problem happens very rarely in real-world Thai because the
ambiguous sequences are rare and must be at the end of a line. I never hear a
Thai complain about this.
--
You are receiving this mail because:
You are the assignee for the bug._______________________________________________
Libreoffice-bugs mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs