> It is important to understand that any ISO-2022-?? encoded CJK
> ideograph can be converted into a tuple consisting of a UCS code and
> one letter out of G,T,J,K,V *without* any loss of information.

Not yet, AFAIK.  Higher CNS planes are not yet in Unicode (but will
come soon in the astral plane).

> So for guaranteed ISO 2022 -> UCS -> ISO 2022 round-trip
> compatibility of CJK ideographs, source standard country tagging of
> UCS plaintext might seem to be the cleanest approach at first.

BTW, this is the envisioned route for Unicode support in Emacs, using
text attributes to store the original character sets in case the user
wants to convert it back to ISO 2022.


    Werner
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/

Reply via email to