On 5/25/06, Bjoern Hoehrmann <[EMAIL PROTECTED]> wrote:
* Aron Stansvik wrote:
>On 5/25/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
>> <?xml version="1.0" encoding="ISO-8859-1"?>
>> <rss version="2.0">
>>    <channel>
>>       <title>Aftonbladet &#246;jesliv</title>
>>    </channel>
>> </rss>
>>
>> I try to extract the title element from the above. But the encoding is not
>> recognised. What i get is this:
>> Aftonbladet öjesliv
>
>What do you mean the encoding is not recognized? That looks like a
>perfectly valid result. &#246; is U+00F6 LATIN SMALL LETTER O WITH
>DIAERESIS.

This appears to be a defect in your mail user agent, the message you
reponded to was ISO-8859-1 encoded and had the o-umlaut encoded as two
octets (C3 B6, which is the proper UTF-8 sequence). The original problem
appears to the the usual "API gives UTF-8 but I expect something else".

Ah. Right. Using Gmail so it showed it just fine.

Aron
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml

Reply via email to