Markus,
I think your wcwidth implementation
(http://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.c) and mine (in libutf8)
should be changed as follows:
* Make the LINE SEPARATOR and PARAGRAPH SEPARATOR (categories Zl and
Zp) non-printable, i.e. wcwidth returns -1 for them. These two
separators are modern forms of U+000A and U+000C and should be
handled like them.
* Make the characters of category Cf have width 0. The Unicode 3.0
book, in the section about U+200E, U+200F, U+202A..U+202E, talks
about "the other zero-width characters", implying that they are
zero-width anthough they are not listed as Non-Spacing in PropList.txt.
What do you think?
Bruno
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/