Serhiy Storchaka added the comment: The patch in issue17909 was written to implement encoding detecting described in RFC 4627 [1]. And the test uses RFC 4627 conforming data. A single codepoint "5" is not valid in RFC 4627, but is valid in RFC 7159 [2].
The comment in your patch is not accurate since '5' is encoded to 1 byte with utf-8 and 4 bytes with utf-32*. [1] https://tools.ietf.org/html/rfc4627 [2] https://tools.ietf.org/html/rfc7159 ---------- _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue28541> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com