Enable compression in HBase Export ---------------------------------- Key: HBASE-2225 URL: https://issues.apache.org/jira/browse/HBASE-2225 Project: Hadoop HBase Issue Type: Improvement Components: util Affects Versions: 0.20.1 Environment: OS agnostic Reporter: Ted Yu Priority: Minor
org.apache.hadoop.hbase.mapreduce.Export should set compression codec In createSubmittableJob(), the following should be added: FileOutputFormat.setCompressOutput(job, true); FileOutputFormat.setOutputCompressorClass(job, org.apache.hadoop.io.compress.GzipCodec.class); >From my experiment, 10% to 50% reduction in Export output has been observed. SequenceFileInputFormat used by the Import tool is able to detect GzipCodec - there is no change for Import class. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.