Bram Moolenaar wrote on 2000-07-24 10:34 UTC: > Try out the new Vim version 6.0c. It keeps malformed sequences. Displaying > them isn't working well though. I could use some suggestions on how to do > that. Perhaps it's best to display each malformed byte with a special > character? I would expect an editor to treat bytes of malformed UTF-8 sequences just like an ASCII editor treats upper half ISO 8859 characters. One very common convention is to represent them as a backslash followed by three octal digits as in \377. "Less" writes <9C> in inverse, which is probably nicer. Something like that (hex is typically far more useful than octal). Markus -- Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK Email: mkuhn at acm.org, WWW: <http://www.cl.cam.ac.uk/~mgk25/> - Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/lists/
- Substituting malformed UTF-8 sequences in a decoder Markus Kuhn
- Re: Substituting malformed UTF-8 sequences in a ... Edmund GRIMLEY EVANS
- Re: Substituting malformed UTF-8 sequences in a ... Markus Kuhn
- Re: Substituting malformed UTF-8 sequences in a ... Edmund GRIMLEY EVANS
- Re: Substituting malformed UTF-8 sequences i... Markus Kuhn
- Re: Substituting malformed UTF-8 sequences in a ... Bram Moolenaar
- Re: Substituting malformed UTF-8 sequences i... Markus Kuhn
- Re: Substituting malformed UTF-8 sequences in a ... Florian Weimer
- Re: Substituting malformed UTF-8 sequences in a ... Markus Kuhn
- Re: Substituting malformed UTF-8 sequences i... Florian Weimer
- Re: Substituting malformed UTF-8 sequences in a ... Edmund GRIMLEY EVANS
- Re: Substituting malformed UTF-8 sequences i... Markus Kuhn
- Re: Substituting malformed UTF-8 sequences in a ... Bruno Haible
- Re: Substituting malformed UTF-8 sequences in a ... David Starner
- Re: Substituting malformed UTF-8 sequences in a ... Markus Kuhn
- Re: Substituting malformed UTF-8 sequences i... Edmund GRIMLEY EVANS
- Re: Substituting malformed UTF-8 sequences in a ... Bruno Haible