Re: Bzip2 vs Gzip

2013-09-17 Thread Rahul Bhattacharjee
Yes , bzip2 is splittable. Tradeoffs - I have not done much experimentation with codecs. Thanks, Rahul On Wed, Sep 18, 2013 at 2:07 AM, Amit Sela wrote: > Hi all, > I'm using hadoop 1.0.4 and using gzip to keep the logs processed by hadoop > (logs are gzipped into block size files). > I read t

Bzip2 vs Gzip

2013-09-17 Thread Amit Sela
Hi all, I'm using hadoop 1.0.4 and using gzip to keep the logs processed by hadoop (logs are gzipped into block size files). I read that bzip2 is splittable. Is it so in hadoop 1.0.4 ? Does that mean that any input file bigger then block size will be split between maps ? What are the tradeoffs betw