[ https://issues.apache.org/jira/browse/ORC-101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15495207#comment-15495207 ]
ASF GitHub Bot commented on ORC-101: ------------------------------------ Github user omalley commented on a diff in the pull request: https://github.com/apache/orc/pull/60#discussion_r79100160 --- Diff: java/core/src/java/org/apache/orc/util/BloomFilter.java --- @@ -130,7 +125,7 @@ public void addString(String val) { if (val == null) { add(null); } else { - add(val.getBytes()); + add(val.getBytes(Charset.defaultCharset())); --- End diff -- No, this one is explicitly the default charset to be compatible. BloomFilterUtf8 is the fixed one. > Correct the use of the default charset in the bloomfilter > --------------------------------------------------------- > > Key: ORC-101 > URL: https://issues.apache.org/jira/browse/ORC-101 > Project: Orc > Issue Type: Improvement > Reporter: Owen O'Malley > Assignee: Owen O'Malley > > Currently ORC's bloom filter depends on the default character set, which > isn't constant between computers. -- This message was sent by Atlassian JIRA (v6.3.4#6332)