Temporary map-ouput files don't use Avro format. There is a custom format which 
should be compressed if you set mapred.compress.map.output.

Arun

On Jan 12, 2012, at 8:08 AM, Frank Grimes wrote:

> I tried conf.setBoolean("mapred.compress.map.output", true); but it didn't 
> seem to work.
> 
> Also, since I'm using the Avro mapred APIs, maybe there's something Avro 
> specific to get it enabled?
> Should I ask on the Avro mailing lists? 
> 
> Thanks,
> 
> Frank Grimes
> 
> 
> On 2012-01-12, at 10:49 AM, bejoy.had...@gmail.com wrote:
> 
>> Hi Frank
>>      Is map output compression enabled?
>> 
>> The config param would be like
>> mapred.map.output.compress=true
>> (It is from my memory, Please cross check)
>> 
>> ------Original Message------
>> From: Frank Grimes
>> To: hdfs-user@hadoop.apache.org
>> ReplyTo: hdfs-user@hadoop.apache.org
>> Subject: Can spill to disk be in compressed format to reduce I/O?
>> Sent: Jan 12, 2012 21:10
>> 
>> Hi All,
>> 
>> We're trying to speed up an M/R job which combines multiple .avro files.
>> We've noticed that when it spills to disk, it's in uncompressed format.
>> Is there a way to make it spill temporary segments as .avro with Deflate 
>> compression?
>> 
>> Thanks,
>> 
>> Frank Grimes
>> 
>> Regards
>> Bejoy K S
> 

Reply via email to