On Monday, June 2, 2014 7:53:05 AM UTC+5:30, Tim Delaney wrote:
> On 2 June 2014 11:14, Steven D'Aprano <steve+comp....@pearwood.info> wrote:
>> Latin-1 is one of those legacy encodings which needs to die, not to be
>> entrenched as the default. My terminal uses UTF-8 by default (as it
>> should), and if I use the terminal to input "δжç", Python ought to see
>> what I input, not Latin-1 moji-bake.
> For some purposes, there needs to be a way to treat an arbitrary
> stream of bytes as an arbitrary stream of 8-bit
> characters. iso-latin-1 is a convenient way to do that. It's not the
> only way, but settling on it and being consistent is better than not
> having a way.
Here is a quote from the oracle docs:
| The C locale, also known as the POSIX locale, is the POSIX system
| default locale for all POSIX-compliant systems.
In more layman language
| ASCII also known as the 'Unix locale' is the default for all *nix
| compliant systems
which is a key aspect of what Ive called 'The UNIX Assumption' :