Eli Acherkan created HADOOP-14376:
-------------------------------------

             Summary: Memory leak when reading a bzip2-compressed file using 
the native library
                 Key: HADOOP-14376
                 URL: https://issues.apache.org/jira/browse/HADOOP-14376
             Project: Hadoop Common
          Issue Type: Bug
          Components: common, io
    Affects Versions: 2.7.0
            Reporter: Eli Acherkan


Opening and closing a large number of bzip2-compressed input streams causes the 
process to be killed on OutOfMemory when using the native bzip2 library.

Our initial analysis suggests that this can be caused by {{DecompressorStream}} 
overriding the {{close()}} method, and therefore skipping the line from its 
parent: {{CodecPool.returnDecompressor(trackedDecompressor)}}. When the 
decompressor object is a {{Bzip2Decompressor}}, its native {{end()}} method is 
never called, and the allocated memory isn't freed.

If this analysis is correct, the simplest way to fix this bug would be to 
replace {{in.close()}} with {{super.close()}} in {{DecompressorStream}}.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to