[ https://issues.apache.org/jira/browse/THRIFT-395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695128#action_12695128 ]
David Reiss commented on THRIFT-395: ------------------------------------ @Jonathan: I think you are still assuming that the "string" is a "Unicode string" or a "string of Unicode code points". Just because something is a "string" doesn't mean it is Unicode. @Chad: Please feel free to ask questions on 413 and 414. They both have fairly simple patches posted, and shouldn't change any existing behavior. @Bryan: Even if we decide that that we want strings to be always Unicode, there are encodings other than UTF-8, and I don't see why we should prevent users from using annotations to specify an alternate encoding. > Python library + compiler does not support unicode strings > ---------------------------------------------------------- > > Key: THRIFT-395 > URL: https://issues.apache.org/jira/browse/THRIFT-395 > Project: Thrift > Issue Type: Improvement > Components: Compiler (Python), Library (Python) > Reporter: Jonathan Ellis > Assignee: Jonathan Ellis > Fix For: 0.1 > > Attachments: > 0001-python-Minor-cleanup-of-protocols-don-t-use-str.patch, > 0002-THRIFT-395.-python-Phase-One-of-support-for-unicode.patch, > 0003-THRIFT-395.-python-Phase-Two-of-support-for-unicode.patch, > 0004-python-Remove-ridiculous-semicolons-from-gen-code.patch, > python-utf8-v2.patch, python-utf8.patch > > > Effectively, all strings in the python bindings are treated as binary strings > -- no encoding/decoding to UTF-8 is done. So if a unicode object is passed > to a (regular, non-binary) string, an exception is raised. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.