[issue15372] Python is missing alternative for common quoting character

2012-07-16 Thread Antoine Pitrou
Changes by Antoine Pitrou : -- status: open -> closed ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe: http://ma

[issue15372] Python is missing alternative for common quoting character

2012-07-16 Thread Matthew Barnett
Matthew Barnett added the comment: A codepoint such as "é" ("\N{LATIN SMALL LETTER E WITH ACUTE}") can be decomposed to "\u0065\u0301" ("\N{LATIN SMALL LETTER E}\N{COMBINING ACUTE ACCENT"), but "\u201c" ("\N{LEFT DOUBLE QUOTATION MARK}") and "\u201d" ("\N{RIGHT DOUBLE QUOTATION MARK}") cannot

[issue15372] Python is missing alternative for common quoting character

2012-07-16 Thread R. David Murray
R. David Murray added the comment: I don't understand why you would expect to get a ". The unicode characters aren't "s. As far as I can see (from, for example, http://codepoints.net/U+201C), Python is behaving as expected here. -- nosy: +r.david.murray resolution: -> invalid stage

[issue15372] Python is missing alternative for common quoting character

2012-07-16 Thread Luca Fabbri
New submission from Luca Fabbri : Using the unicodedata.decomposition function on characters like \u201c and \u201d I didn't get back the classic quote character ("). This is a very common error when text is taken from Microsoft Word (where in italian language a couple of quoting character in