New submission from Mingye Wang:
Mappings for 0x81 and 0x8D in multiple Windows code pages diverge from what
Windows does. Attached is a script that tests for this behavior. (These two
bytes are not necessary the only problems, but for sure they are the most
widespread and famous ones. Again, refer to Unicode best fit for something that
works.)
This problem is seen in Python 2.7.10 on Windows 10b14959, but apparently it is
known since long ago[1]. Python 3.4.3 on Cygwin also fails
``b'\x81\x8d'.encode('cp1252')``.
[1]: https://ftfy.readthedocs.io/en/latest/#module-ftfy.bad_codecs.sloppy
----------
components: Unicode
files: pycp.py
messages: 280914
nosy: Artoria2e5, ezio.melotti, haypo
priority: normal
severity: normal
status: open
title: Non-Windows mappings for a couple of Windows code pages
type: behavior
versions: Python 2.7, Python 3.3, Python 3.4, Python 3.5, Python 3.6, Python 3.7
Added file: http://bugs.python.org/file45497/pycp.py
_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue28712>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com