Re: [Python-ideas] [issue33865] [EASY] Missing code page aliases: "unknown encoding: 874"

2018-06-25 Thread Stephen J. Turnbull
Ronald Oussoren writes: > The user shouldn’t have to do anything other than install Python. IMHO > were doing something wrong when the python interpreter doesn’t start up > with a default system configuration There's no evidence in the issue that I can see that suggests that the user

Re: [Python-ideas] [issue33865] [EASY] Missing code page aliases: "unknown encoding: 874"

2018-06-22 Thread Ronald Oussoren
On 21 Jun 2018, at 09:17, Stephen J. Turnbull wrote:Ronald Oussoren writes:Possibly just for the “cp…” encodings, but IMHO only if we confirmthat the code to look for the preferred encoding returns a codepagenumber on Windows and changing that code leads to worse resultsthan adding numeric

Re: [Python-ideas] [issue33865] [EASY] Missing code page aliases: "unknown encoding: 874"

2018-06-21 Thread Stephen J. Turnbull
Ronald Oussoren writes: > Possibly just for the “cp…” encodings, but IMHO only if we confirm > that the code to look for the preferred encoding returns a codepage > number on Windows and changing that code leads to worse results > than adding numeric aliases for the “cp…” encodings. Almost

Re: [Python-ideas] [issue33865] [EASY] Missing code page aliases: "unknown encoding: 874"

2018-06-18 Thread Ronald Oussoren
> On 18 Jun 2018, at 02:34, Steven D'Aprano wrote: > >> Sure, but for at least one user Python 3.6 fails to start because >> initialising the sys.std* streams fails due to not finding a “874” >> encoding. > > That doesn't mean that the bug is best fixed by adding an alias. I agree, I’ve

Re: [Python-ideas] [issue33865] [EASY] Missing code page aliases: "unknown encoding: 874"

2018-06-18 Thread Karthikeyan
> BTW. “cp874” does exist according to the unicode consortium: https://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP874.TXT, and appears to be a codepage for a (the?) Thai language. The user might therefore be running Windows with a Thai locale. This page

Re: [Python-ideas] [issue33865] [EASY] Missing code page aliases: "unknown encoding: 874"

2018-06-17 Thread Steven D'Aprano
> Sure, but for at least one user Python 3.6 fails to start because > initialising the sys.std* streams fails due to not finding a “874” > encoding. That doesn't mean that the bug is best fixed by adding an alias. If the error was failing to find encoding "ltain-1", would we add an alias or

Re: [Python-ideas] [issue33865] [EASY] Missing code page aliases: "unknown encoding: 874"

2018-06-17 Thread Stephen J. Turnbull
Folks. There are standards. "1252" *is not* an alias for "windows-1252" according to the IANA, while "866" *is* an alias for "IBM866" according to the same authority. Most 3-digit "IBMxxx" ARE aliased to both "cpxxx" and just "xxx", but not all. None of "IBM874", "874", or "cp874" exists

Re: [Python-ideas] [issue33865] [EASY] Missing code page aliases: "unknown encoding: 874"

2018-06-16 Thread Steven D'Aprano
> It is easy to test it. Encoding/decoding with '874' should give the > same result as with 'cp874'. I know it is too late to remove that feature, but why do we support digit-only IDs for encodings? They can be ambiguous. If Wikipedia is correct, cp874 (also known as ibm874) and Windows-874