Juanma Barranquero <[EMAIL PROTECTED]> writes:

> On 9/28/05, LENNART BORGMAN <[EMAIL PROTECTED]> wrote:
>
>> I have run into a problem with swedish national characters in an
>> XHTML document. The header of the document is like this:
>>
>>   <?xml version="1.0" encoding="utf-8"?>
>>   <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"
>>             "http://www.w3.org/TR/REC-html40/loose.dtd";>
>>   <html xmlns="http://www.w3.org/1999/xhtml"; xml:lang="en">
>>
>> The swedish character รค looks like \344 in CVS Emacs (2005-09-23).
>
> Hmm. An XHTML document with encoding="utf-8" should not have
> "swedish national characters" in it, should it? Upon reading the
> file, Emacs will set its coding system to mule-utf-8, so it's no
> surprise than high-bit, non-valid utf8 byte sequences appear as
> \xxx...

I might be wrong here, but doesn't UTF-8 encode all characters in
Latin-1 (ISO 8859-1) exactly as they are *in* Latin-1 encoding?



_______________________________________________
Emacs-devel mailing list
Emacs-devel@gnu.org
http://lists.gnu.org/mailman/listinfo/emacs-devel

Reply via email to