Robert Joseph Evans created MAPREDUCE-5082:
----------------------------------------------

             Summary: CodecPool should avoid OOMs with buggy codecs
                 Key: MAPREDUCE-5082
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5082
             Project: Hadoop Map/Reduce
          Issue Type: Bug
            Reporter: Robert Joseph Evans


I recently found a bug in the gpl compression libraries that was causing map 
tasks for a particular job to OOM.

https://github.com/omalley/hadoop-gpl-compression/issues/3

Now granted it does not make a lot of sense for a job to use the LzopCodec for 
map output compression over the LzoCodec, but arguably other codecs could be 
doing similar things and causing the same sort of memory leaks.  I propose that 
we do a sanity check when creating a new decompressor/compressor.  If the codec 
newly created object does not match the value from getType... it should turn 
off caching for that Codec.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to