Tao Li created HADOOP-13849:
-------------------------------

             Summary: Bzip2 java-builtin and system-native have almost the same 
compress speed
                 Key: HADOOP-13849
                 URL: https://issues.apache.org/jira/browse/HADOOP-13849
             Project: Hadoop Common
          Issue Type: Bug
          Components: common
    Affects Versions: 2.6.0
         Environment: os version: redhat6
hadoop version: 2.6.0
native bzip2 version: bzip2-devel-1.0.5-7.el6_0.x86_64

            Reporter: Tao Li


I tested bzip2 java-builtin and system-native compression, and I found the 
compress speed is almost the same. (I think the system-native should have 
better compress speed than java-builtin)

My test case:
input: 2.7GB text file without compression
bzip2 java-builtin compress: 457MB, 12min 4sec
bzip2 system-native compress: 457MB, 12min 19sec

My MapReduce Config:
conf.set("mapreduce.fileoutputcommitter.marksuccessfuljobs", "false");
conf.set("mapreduce.output.fileoutputformat.compress", "true");
conf.set("mapreduce.output.fileoutputformat.compress.type", "BLOCK");
conf.set("mapreduce.output.fileoutputformat.compress.codec", 
"org.apache.hadoop.io.compress.BZip2Codec");
conf.set("io.compression.codec.bzip2.library", "java-builtin"); // for 
java-builtin
conf.set("io.compression.codec.bzip2.library", "system-native"); // for 
system-native




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to