Serhiy Storchaka added the comment:
What problem is purposed to solve clean_surrogate_escapes()? Could you please
provide user scenario or two?
Possible alternative implementation is:
def clean_surrogate_escapes(s):
return s.encode('utf-8', 'surrogatepass').decode('utf-8', 'replace')
It can be faster for some data (for mostly ASCII with rare surrogates it is
superfast). For other data 'utf-16' can be better choice.
----------
_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue18814>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com