Re: EastAsianWidth revised

Markus Kuhn Sat, 08 Sep 2001 12:27:01 -0700
On Sun, 9 Sep 2001, Tomohiro KUBOTA wrote:
> As you know, Unicode 3.1.1 is released.  It revised the East
> Asian Width for 15 characters.

The following 15 characters went from neutral to ambiguous,
probabaly someone discovered them in some CJK character set
that is displayed there double-width:

00AE;A # REGISTERED SIGN
014B;A # LATIN SMALL LETTER ENG
02C4;A # MODIFIER LETTER UP ARROWHEAD
02DF;A # MODIFIER LETTER CROSS ACCENT
2022;A # BULLET
2024;A # ONE DOT LEADER
203E;A # OVERLINE
2116;A # NUMERO SIGN
2153;A # VULGAR FRACTION ONE THIRD
215C;A # VULGAR FRACTION THREE EIGHTHS
215D;A # VULGAR FRACTION FIVE EIGHTHS
21B8;A # NORTH WEST ARROW TO LONG BAR
21B9;A # LEFTWARDS ARROW TO BAR OVER RIGHTWARDS ARROW TO BAR
21E7;A # UPWARDS WHITE ARROW
273D;A # HEAVY TEARDROP-SPOKED ASTERISK

> Markus, could you please update your wcwidth() implementation?

The normal wcwidth() did not change as a result of Unicode 3.1.1,
because  both neutral and ambiguous characters result there in
the same width: 1

I just updated the still somewhat experimental wcwidth_cjk(),
in case people found that so far actually useful. It contains
a new table of EastAsianWidth Ambiguous characters.

http://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.c

Enjoy ...

Markus

-- 
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org,  WWW: <http://www.cl.cam.ac.uk/~mgk25/>

-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/
Re: EastAsianWidth revised

Reply via email to