We are using Hadoop 0.20 and mapred.map.output.compression.Codec is set to DefaultCodec. We tried LZO but the performance seems very similar to DefaultCodec.
I heard of a lot of good words about LZO. So did anybody compare LZO with DefaultCodec? Is there a big difference? We are running CentOS release 5.2 (Final). Thanks, Zheng
