Can you give a little more details? For example, you tried a single .bz file as input, and the pig job has 2 or more mappers?
I didn't know bz2 was splittable. Zheng On Tue, Dec 2, 2008 at 1:18 AM, Josh Ferguson <[EMAIL PROTECTED]> wrote: > It is splittable because of how the compression uses blocks, Pig does this > out of the box. > > Josh > > > On Dec 2, 2008, at 1:14 AM, Zheng Shao wrote: > > It shouldn't be a problem for Hive to support it (by defining your own >> input/output file format that does the decompression on the flyer), but we >> won't be able to parallelize the execution as we do with uncompressed text >> files, and sequence files, since bz2 compression is not splittable. >> > > -- Yours, Zheng
