Hi, I am using hadoop-0.20.1 . When I have a bzip2 file of larger size than the configured block -size. Only single mapper gets launched. clearly says that its not using split-able feature of bzip2.
But when I use my own InputFormat say SafeInputFormat( extends FileInputFormat ) and allow isSplitable true. It executes multiple mappers, but fails when reducers reaches 33% for the large size(of order of 2 GB) of bzip2 files. Above works well with smaller bzip2 file size(of order of 500MB ). Kindly help me to get the workable solution here. Thanks & regards, - Deepak Diwakar,