Robert Joseph Evans created MAPREDUCE-5082: ----------------------------------------------
Summary: CodecPool should avoid OOMs with buggy codecs Key: MAPREDUCE-5082 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5082 Project: Hadoop Map/Reduce Issue Type: Bug Reporter: Robert Joseph Evans I recently found a bug in the gpl compression libraries that was causing map tasks for a particular job to OOM. https://github.com/omalley/hadoop-gpl-compression/issues/3 Now granted it does not make a lot of sense for a job to use the LzopCodec for map output compression over the LzoCodec, but arguably other codecs could be doing similar things and causing the same sort of memory leaks. I propose that we do a sanity check when creating a new decompressor/compressor. If the codec newly created object does not match the value from getType... it should turn off caching for that Codec. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira