By the way, even though on my planets here, all coded text in files that we edit is exclusively in UTF-8, this does *not* mean that all the files I edit are free of malformed UTF-8 sequences.
There are many real-world file formats (e.g., PBM) that start with a plaintext header and end in binary data data. Here on my planet, the plaintext headers are of course in UTF-8, but to edit it, I also have to load the binary data at the end of the file into the editor buffer, and this should not affect the integrity of the file or reduce my convenience in editing the UTF-8 headers. I preferably want the editor to show bytes of malformed sequences in some hex notation and preserve them exactly when I save the file again. [The UTF-8 mode of Emacs 21.2 is not yet able to do this, which is why Emacs is not yet a popular editor on my planet.] Markus -- Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK Email: mkuhn at acm.org, WWW: <http://www.cl.cam.ac.uk/~mgk25/> -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
