On Fri, 1 Apr 2022 at 03:45, Dennis Lee Bieber <wlfr...@ix.netcom.com> wrote:
>
> On Thu, 31 Mar 2022 00:36:10 -0700 (PDT), moi <wxjmfa...@gmail.com>
> declaimed the following:
>
> >>>> 'äÄöÖüÜ'.encode('utf-8')
> >b'\xc3\xa4\xc3\x84\xc3\xb6\xc3\x96\xc3\xbc\xc3\x9c'
> >>>> len('äÄöÖüÜ'.encode('utf-8'))
> >12
> >>>>
> >>>> ?
>
>         Is there a question in there somewhere?
>
>         Crystal ball is hazy...
>
>         However... Note that once you encode the Unicode literal, you have a
> BYTE string. There are 12 bytes in that binary -- it is NOT considered
> Unicode at that point (only when you decode it with the same CODEC will it
> be Unicode).
>

That's jmf. Ignore him. He knows nothing about Unicode and is
determined to make everyone aware of that fact.

He got blocked from the mailing list ages ago, and I don't think
anyone's regretted it.

ChrisA
-- 
https://mail.python.org/mailman/listinfo/python-list

Reply via email to