[hbase] Replace all String.getBytes() with String.getBytes("UTF-8") -------------------------------------------------------------------
Key: HADOOP-1821 URL: https://issues.apache.org/jira/browse/HADOOP-1821 Project: Hadoop Issue Type: Bug Components: contrib/hbase Affects Versions: 0.15.0 Reporter: Jim Kellerman Assignee: Jim Kellerman Fix For: 0.15.0 We cannot rely on the default encoding being UTF-8 so a naked String.getBytes() will return the bytes in whatever the default encoding is for the platform on which code is running. If it is subsequently read on another machine with a different default encoding, converting the bytes back to a string will result in garbage. Consequently, we should always specify an encoding for getBytes() and new String. UTF-8 is the preferred encoding. The places where we use unqualified getBytes are: HConstants.DELETE_BYTES, HConstants.COMPLETE_CACHEFLUSH hbase.io.MapWritable.main (but this will not be an issue once HADOOP-1760 is completed) TestHMemcache.addRows PerformanceEvaluation.generateValue TestGet TestHRegion TestHBaseCluster TestTableMapReduce TestScanner2 TestRegExpRowFilter TestRowFilterSet org.onelab.test.StringKey -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.