Marc 'BlackJack' Rintsch:
>In <[EMAIL PROTECTED]>, willie wrote:
>> # What's the correct way to get the
>> # byte count of a unicode (UTF-8) string?
>> # I couldn't find a builtin method
>> # and the following is memory inefficient.
>> ustr = "example\xC2\x9D".decode('UTF-8')
>> num_chars = len(ustr) # 8
>> buf = ustr.encode('UTF-8')
>> num_bytes = len(buf) # 9
>That is the correct way.
# Apologies if I'm being dense, but it seems
# unusual that I'd have to make a copy of a
# unicode string, converting it into a byte
# string, before I can determine the size (in bytes)
# of the unicode string. Can someone provide the rational
# for that or correct my misunderstanding?
# Thanks.
--
http://mail.python.org/mailman/listinfo/python-list