> -----Original Message-----
> From: Robert Brady [mailto:[EMAIL PROTECTED]]
...
> > Maybe not those, but it is my understanding that the "Ambiguous"
> > width value (in the Unicode EA Width data) indicates that the
> > character is "Wide" in some non-Unicode EA encodings, and
> > "Narrow" in some other, usually not EA, encoding.  So you need
> > to know the "context" (here: originally used, non-Unicode,
> > character encoding) to figure out the "width".
> 
> Exactly. This why said UTR is totally useless for terminals.
> 
> We can't send context information along with text files or telnet/ssh
> sessions - we just have to define a context that is acceptable for as
> many people as possible and just use it everywhere.
> 
> This won't support silly CJK contexts with doublewidth 
> cyrillic letters, but does anyone care?

I don't know if anyone *really* cares, but they are there, and
they are "doublewidth".  But: 1) UAX 11 does care about this for
compatibility reasons; and 2) this kind of "width" data really
belongs *in* (or associated with) the font.  So by choosing an
appropiate font list (in your terminal emulator preferences, or
whatever) one could provide the "context" needed, even for PUA
characters.  An FDCC-set is most certainly the wrong place (unless
you also consider putting fonts there...).

                Kind regards
                /kent k
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/

Reply via email to