[ 
https://issues.apache.org/jira/browse/CASSANDRA-2379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13010785#comment-13010785
 ] 

Ed Anuff commented on CASSANDRA-2379:
-------------------------------------

Actually, it may a good idea to avoid using Charset.defaultCharset().  On the 
Mac, at least, that's going to be "MacRoman" and on all platforms may change 
due to the settings of the system "file.encoding" property.  Shouldn't we be 
making sure we're using UTF8?

> ByteBufferUtil#bytes(String) can produce undesired results for some characters
> ------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-2379
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2379
>             Project: Cassandra
>          Issue Type: Bug
>            Reporter: Nate McCall
>         Attachments: 2379.txt
>
>
> The difference between getBytes(java.nio.charset.Charset) vs. 
> getBytes("[charsetname]") on some platforms (mac it seems) can be 
> substantial. From the java.lang.String javadoc for the former:
> This method always replaces malformed-input and unmappable-character 
> sequences with this charset's default replacement byte array...
> vs. the latter:
> The behavior of this method when this string cannot be encoded in the default 
> charset is unspecified.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to