Can you give a little more details?
For example, you tried a single .bz file as input, and the pig job has 2 or
more mappers?

I didn't know bz2 was splittable.

Zheng
On Tue, Dec 2, 2008 at 1:18 AM, Josh Ferguson <[EMAIL PROTECTED]> wrote:

> It is splittable because of how the compression uses blocks, Pig does this
> out of the box.
>
> Josh
>
>
> On Dec 2, 2008, at 1:14 AM, Zheng Shao wrote:
>
>  It shouldn't be a problem for Hive to support it (by defining your own
>> input/output file format that does the decompression on the flyer), but we
>> won't be able to parallelize the execution as we do with uncompressed text
>> files, and sequence files, since bz2 compression is not splittable.
>>
>
>


-- 
Yours,
Zheng

Reply via email to