Re: line breaking implementation available

Werner LEMBERG Thu, 01 Mar 2001 06:11:12 -0800

> Line breaking for Unicode is different from the well-known "look for
> spaces" technique, which works only in European languages. In CJK
> languages, a break can occur between any adjacent ideographic
> characters.
>
> It implements line breaking for UTF-8 strings and, through iconv,
> also for strings in any iconv supported encoding. It will be put
> under LGPL.

I haven't had time yet to look at your implementation, but I wonder
whether it can handle e.g. special CJK interpunctuation characters or
Japanese hiragana which must not occur at the beginning or end of a
line.


    Werner
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/

Re: line breaking implementation available

Reply via email to