Hello Thomas,

2007/10/22, Thomas Petazzoni <[EMAIL PROTECTED]>:
> The string you mention is encoded in ISO-8859-1 (or ISO-8859-15) : the
> special character é is encoded on one byte only, so it's not UTF-8.

I'm not sure of that. If you look at the Unicode table for Latin1
(http://www.unicode.org/charts/PDF/U0080.pdf), the encoding of é is
00E9 (p. 7).

As the string is explicitly marked as Unicode string (u'string') in
Python, I would say that this indeed an Unicode string, with the é
showed in hexadecimal.

Yours,
d.


_______________________________________________
Demexp-dev mailing list
Demexp-dev@nongnu.org
http://lists.nongnu.org/mailman/listinfo/demexp-dev

Répondre à