On Mon, 20 Dec 2004 12:49:39 +0200, Miki Tebeka <[EMAIL PROTECTED]> wrote:
>Hello Joe,
>
>> Is there any library to convert HTML page with \uXXXX encoded text to
>> native character set, e.g. BIG5.
>Try: help("".decode)
>
But the OP wants to en-code, I think. E.g. (I don't know what Chinese for ichi
is ;-)
>>> ichi = u'\u4e00'
>>> ichi
u'\u4e00'
>>> ichi.encode('big5')
'\xa4@'
UIAM that created two str bytes constituting big5 code for
the single horizontal stroke glyph whose unicode code is u'\u4e00'
>>> list(ichi.encode('big5'))
['\xa4', '@']
going from big5-encoded str back to unicode then takes de-coding:
>>> '\xa4@'.decode('big5')
u'\u4e00'
Regards,
Bengt Richter
--
http://mail.python.org/mailman/listinfo/python-list