Re: Displaying malformed UTF-8 sequences in an editor

Markus Kuhn Mon, 24 Jul 2000 03:51:26 -0700

Bram Moolenaar wrote on 2000-07-24 10:34 UTC:
> Try out the new Vim version 6.0c.  It keeps malformed sequences.  Displaying
> them isn't working well though.  I could use some suggestions on how to do
> that.  Perhaps it's best to display each malformed byte with a special
> character?

I would expect an editor to treat bytes of malformed UTF-8 sequences
just like an ASCII editor treats upper half ISO 8859 characters. One
very common convention is to represent them as a backslash followed by
three octal digits as in \377. "Less" writes <9C> in inverse, which is
probably nicer. Something like that (hex is typically far more useful
than octal).

Markus

-- 
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org,  WWW: <http://www.cl.cam.ac.uk/~mgk25/>

-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/

Re: Displaying malformed UTF-8 sequences in an editor

Reply via email to