Juanma Barranquero <[EMAIL PROTECTED]> writes: > On 9/28/05, LENNART BORGMAN <[EMAIL PROTECTED]> wrote: > >> I have run into a problem with swedish national characters in an >> XHTML document. The header of the document is like this: >> >> <?xml version="1.0" encoding="utf-8"?> >> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" >> "http://www.w3.org/TR/REC-html40/loose.dtd"> >> <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en"> >> >> The swedish character รค looks like \344 in CVS Emacs (2005-09-23). > > Hmm. An XHTML document with encoding="utf-8" should not have > "swedish national characters" in it, should it? Upon reading the > file, Emacs will set its coding system to mule-utf-8, so it's no > surprise than high-bit, non-valid utf8 byte sequences appear as > \xxx...
I might be wrong here, but doesn't UTF-8 encode all characters in Latin-1 (ISO 8859-1) exactly as they are *in* Latin-1 encoding? _______________________________________________ Emacs-devel mailing list Emacs-devel@gnu.org http://lists.gnu.org/mailman/listinfo/emacs-devel