[ https://issues.apache.org/jira/browse/SOLR-2034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12900432#action_12900432 ]
Yonik Seeley commented on SOLR-2034: ------------------------------------ Seems OK. I think modified UTF-8 was originally used so that the string chars could be directly written to the output stream instead of to a temp buffer. But copying to a temp buffer first shouldn't have that much overhead. JavaBinCodec.VERSION should be bumped. It is serialized and verified when decoding, and currently an exception is thrown if it does not match the current version. > javabin should use UTF-8, not modified UTF-8 > -------------------------------------------- > > Key: SOLR-2034 > URL: https://issues.apache.org/jira/browse/SOLR-2034 > Project: Solr > Issue Type: Bug > Reporter: Robert Muir > Attachments: SOLR-2034.patch, SOLR-2034.patch > > > for better interoperability, javabin should use standard UTF-8 instead of > modified UTF-8 (http://www.unicode.org/reports/tr26/) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org