Enable compression in HBase Export
----------------------------------
Key: HBASE-2225
URL: https://issues.apache.org/jira/browse/HBASE-2225
Project: Hadoop HBase
Issue Type: Improvement
Components: util
Affects Versions: 0.20.1
Environment: OS agnostic
Reporter: Ted Yu
Priority: Minor
org.apache.hadoop.hbase.mapreduce.Export should set compression codec
In createSubmittableJob(), the following should be added:
FileOutputFormat.setCompressOutput(job, true);
FileOutputFormat.setOutputCompressorClass(job,
org.apache.hadoop.io.compress.GzipCodec.class);
>From my experiment, 10% to 50% reduction in Export output has been observed.
SequenceFileInputFormat used by the Import tool is able to detect GzipCodec -
there is no change for Import class.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.