Markus,

I think your wcwidth implementation
(http://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.c) and mine (in libutf8)
should be changed as follows:

  * Make the LINE SEPARATOR and PARAGRAPH SEPARATOR (categories Zl and
    Zp) non-printable, i.e. wcwidth returns -1 for them. These two
    separators are modern forms of U+000A and U+000C and should be
    handled like them.

  * Make the characters of category Cf have width 0. The Unicode 3.0
    book, in the section about U+200E, U+200F, U+202A..U+202E, talks
    about "the other zero-width characters", implying that they are
    zero-width anthough they are not listed as Non-Spacing in PropList.txt.

What do you think?

                         Bruno
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/

Reply via email to