Ezio Melotti <[email protected]> added the comment:
Thanks for the patch!
> * fix an error in the error handler for utf-16-le. (In, Python3.2
> b'\xdc\x80\x00\x41'.decode('utf-16-be', 'ignore') returns "\x00"
> instead of "A" for some reason)
This should probably be done on a separate patch that will be applied to
3.2/3.3 (assuming that it can go to 3.2). Rejecting surrogates will go in 3.3
only. (Note that lot of Unicode-related code changed between 3.2 and 3.3.)
> Should we really reject lone surrogates for UTF-7?
No, I meant only UTF-8/16/32; UTF-7 is fine as is.
----------
_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue12892>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com