[issue7330] PyUnicode_FromFormat segfault

STINNER Victor Fri, 18 Feb 2011 07:09:18 -0800

STINNER Victor <[email protected]> added the comment:

> Oh, what if the trunked char* cannot be decoded correctly?
> e.g. a tow-bytes character is divided in the middle?


Yes, but PyUnicode_FromFormatV() uses UTF-8 decoder with replace error handler, 
and so the incomplete byte sequence will be replaced by � (it doesn't fail with 
an error). Example:

>>> "abc€".encode("utf-8")[:-1].decode("utf-8", "replace")
'abc�'

----------

_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue7330>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue7330] PyUnicode_FromFormat segfault

Reply via email to