By the way, even though on my planets here, all coded text in files that
we edit is exclusively in UTF-8, this does *not* mean that all the files
I edit are free of malformed UTF-8 sequences.

There are many real-world file formats (e.g., PBM) that start with a
plaintext header and end in binary data data. Here on my planet, the
plaintext headers are of course in UTF-8, but to edit it, I also have to
load the binary data at the end of the file into the editor buffer, and
this should not affect the integrity of the file or reduce my
convenience in editing the UTF-8 headers. I preferably want the editor
to show bytes of malformed sequences in some hex notation and preserve
them exactly when I save the file again. [The UTF-8 mode of Emacs 21.2
is not yet able to do this, which is why Emacs is not yet a popular
editor on my planet.]

Markus

-- 
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org,  WWW: <http://www.cl.cam.ac.uk/~mgk25/>

--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to