> Line breaking for Unicode is different from the well-known "look for
> spaces" technique, which works only in European languages. In CJK
> languages, a break can occur between any adjacent ideographic
> characters.
>
> It implements line breaking for UTF-8 strings and, through iconv,
> also for strings in any iconv supported encoding. It will be put
> under LGPL.
I haven't had time yet to look at your implementation, but I wonder
whether it can handle e.g. special CJK interpunctuation characters or
Japanese hiragana which must not occur at the beginning or end of a
line.
Werner
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/