Paul Boddie wrote:
> One side-effect of the "big push" to UTF-8 amongst the Linux
> distribution vendors/maintainers is the evasion of issues such as
> filesystem encodings and "real" Unicode at the system level. In
> Python, when you have a Unicode object, you are dealing with
> idealised
> sequences of characters, whereas in many system and library APIs out
> there you either get back a sequence of anonymous bytes or a sequence
> of UTF-8 bytes that people are pretending is Unicode, right up until
> the point where someone recompiles the software to use UTF-16
> instead,
> thus causing havoc. Anyone who has needed to expose filesystems
> created by Linux distributions before the UTF-8 "big push" to later
> distributions can attest to the fact that the "see no evil" brass
> monkey is wearing a T-shirt with "UTF-8" written on it.

Unfortunately the monkey is painted in the air with a stick, so
not everyone can see it. Python can't. Given a random linux system
how can you tell if the monkey has pushed it already or not?

  Serge.

-- 
http://mail.python.org/mailman/listinfo/python-list

Reply via email to