Changes by Julian Mehnle jul...@mehnle.net:
--
nosy: +jmehnle
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue9133
___
___
Python-bugs-list mailing
New submission from Mike Lewis mikelikes...@gmail.com:
When I do
codecs.encode(codecs.decode('\xed\xbc\xad', 'utf8'), 'utf8')
its not throwing an exception. '\xed\xbc\xad' is an invalid UTF8 byte sequence.
It maps to the value U+DF2D which is a surrogate pair it seems.
Mike Lewis mikelikes...@gmail.com added the comment:
Sorry, meant to add this part to the quote from the rfc:
This leads to different results for character
numbers above 0x; the CESU-8 encoding of those characters is NOT
valid UTF-8
--
___
Ezio Melotti ezio.melo...@gmail.com added the comment:
This is already fixed in Python 3.
However I think that for backward compatibility reasons it can't be fixed in
Python 2, where it is possible to encode and decode every codepoint to/from
UTF-8.
See also
Changes by Marc-Andre Lemburg m...@egenix.com:
--
resolution: - wont fix
status: pending - closed
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue9133
___
Marc-Andre Lemburg m...@egenix.com added the comment:
Ezio Melotti wrote:
I think this can be closed as wontfix.
Agreed. I've already closed the ticket.
--
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue9133
Changes by Ezio Melotti ezio.melo...@gmail.com:
--
stage: - committed/rejected
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue9133
___
___