Koji Noguchi created PIG-3251:
---------------------------------

             Summary: Bzip2TextInputFormat requires double the memory of 
maximum record size
                 Key: PIG-3251
                 URL: https://issues.apache.org/jira/browse/PIG-3251
             Project: Pig
          Issue Type: Improvement
            Reporter: Koji Noguchi
            Assignee: Koji Noguchi
            Priority: Minor


While looking at user's OOM heap dump, noticed that pig's Bzip2TextInputFormat 
consumes memory at both

Bzip2TextInputFormat.buffer (ByteArrayOutputStream) 
and actual Text that is returned as line.

For example, when having one record with 160MBytes, buffer was 268MBytes and 
Text was 160MBytes.  

We can probably eliminate one of them.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to