Re: [Python-ideas] Support WHATWG versions of legacy encodings

2018-01-21 Thread Stephen J. Turnbull
Random832 writes: > I think his point is that the WHATWG standard is the one that > governs HTML and therefore HTML that uses these encodings > (including the C1 characters) are conformant to *that* standard, I don't think that is a tenable interpretation of this standard. The WHAT-WG standard

Re: [Python-ideas] Support WHATWG versions of legacy encodings

2018-01-21 Thread Stephen J. Turnbull
I don't expect to change your mind about the "right" way to deal with this, but this is a more explicit description of what those of us who advocate error handlers are thinking about. It may be useful in writing your PEP (PEPs describe rejected counterproposals and amendments along with adopted pr

Re: [Python-ideas] Support WHATWG versions of legacy encodings

2018-01-21 Thread Guido van Rossum
On Sun, Jan 21, 2018 at 2:43 AM, Steven D'Aprano wrote: > On Fri, Jan 19, 2018 at 06:35:30PM +, Rob Speer wrote: > > Guido had some very sensible feedback just a moment ago. I am wondering > now > > if we lost Guido because I broke python-ideas etiquette (is a pull > request > > not the next

Re: [Python-ideas] Support WHATWG versions of legacy encodings

2018-01-21 Thread Chris Angelico
On Mon, Jan 22, 2018 at 3:36 AM, Rob Speer wrote: > Thanks for the recommendation there, and I'd like a little extra information > -- I don't know _mechanically_ how to write a PEP. (Where do I submit it to, > for example?) I can help you with that side of things. Start by checking out PEP 1: ht

Re: [Python-ideas] Support WHATWG versions of legacy encodings

2018-01-21 Thread Rob Speer
> The question to my mind is whether or not this "latin1replace" handler, > in conjunction with existing codecs, will do the same thing as the > WHATWG codecs. If I have understood you correctly, I think it will. Have > I missed something? It won't do the same thing, and neither will the "chaining

Re: [Python-ideas] Windows Best Fit Encodings

2018-01-21 Thread M.-A. Lemburg
On 20.01.2018 08:01, Steve Dower wrote: > On 20Jan2018 0518, M.-A. Lemburg wrote: >> do you know of a definite resource for Windows code pages >> on MSDN or another official MS website ? > > I don't know of anything sorry, and my quick search didn't turn up > anything public. But I can at least co

Re: [Python-ideas] Support WHATWG versions of legacy encodings

2018-01-21 Thread Steven D'Aprano
On Fri, Jan 19, 2018 at 06:35:30PM +, Rob Speer wrote: > > It depends on what you want to achieve. You may want to fail, assign a > code point from a private area or use a surrogate escape approach. > > And the way to express that is with errors='replace', > errors='surrogateescape', or whatev