Thu, 15 Feb 2001 22:08:18 +0100, Bram Moolenaar <[EMAIL PROTECTED]> pisze:
> If you want to use 8 bit encodings (like latin1), 0x9b is a valid character,
> and you are forced to use ESC sequences...
ISO 8-bit encodings like Latin-1 are split up into C0/G0/C1/G1 regions.
Check out ECMA-43 "8-bit Coded Character Set Structure and Rules" on
http://www.ecma.ch/ecma1/STAND/ECMA-043.HTM
for details. The problem is more with IBM/Microsoft code pages, which do
not follow ECMA-43 = ISO 4873. Their characters can really only be
accessed via UTF-8 safely in the context of a terminal emulator.
Markus
--
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org, WWW: <http://www.cl.cam.ac.uk/~mgk25/>
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/