HBase exporter should compress output files by default (or at least allow this 
as an option)
--------------------------------------------------------------------------------------------

                 Key: HBASE-3166
                 URL: https://issues.apache.org/jira/browse/HBASE-3166
             Project: HBase
          Issue Type: Improvement
          Components: util
    Affects Versions: 0.20.6
            Reporter: Josh Rosenblum
            Priority: Minor


The HBase exporter puts (key, Result) pairs as keys and values into an output 
sequence file.

There could be significant savings at low cost if at least default compression 
was enabled on this output sequence file.

In createSubmittableJob(), this might be as simple as adding the following:

        SequenceFileOutputFormat.setOutputCompressionType(job, 
SequenceFile.CompressionType.BLOCK);
        SequenceFileOutputFormat.setCompressOutput(job, true);
        FileOutputFormat.setOutputCompressorClass(job, DefaultCodec.class);


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to