In article <[email protected]>, Steven D'Aprano <[email protected]> wrote:
> There are various common ways to store Unicode strings in RAM. > > The first, UTF-16. > [...] > Another option is UTF-32. > [...] > Another option is to use UTF-8 internally. > [...] > In Python 3.3, CPython introduced an internal scheme that gives the best > of all worlds. When a string is created, Python uses a different > implementation depending on the characters in the string: This was an excellent post, but I would take exception to the "best of all worlds" statement. I would put it a little less absolutely and say something like, "a good compromise for many common use cases". I would even go with, "... for most common use cases". But, there are situations where it loses. -- https://mail.python.org/mailman/listinfo/python-list
