[
https://issues.apache.org/jira/browse/TEZ-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15231145#comment-15231145
]
Jonathan Eagles commented on TEZ-3202:
--------------------------------------
[~rajesh.balamohan], Can you have a look at this change? This change makes a
huge difference in memory requirements for certain types of jobs. Not sure what
to do exactly with the findbugs warnings. Should I put them in the exclude
file?. Also, what do you think about reusing ByteArrayDataInput as opposed to
creating a dedicated purpose object? A dedicated object might be more clear and
easier to maintain.
> Reduce the memory need for jobs with high number of segments
> ------------------------------------------------------------
>
> Key: TEZ-3202
> URL: https://issues.apache.org/jira/browse/TEZ-3202
> Project: Apache Tez
> Issue Type: Bug
> Reporter: Jonathan Eagles
> Assignee: Jonathan Eagles
> Attachments: TEZ-3202.1.patch, TEZ-3202.2.patch, TEZ-3202.3.patch
>
>
> Segment has a 'key' member that holds accounting information to the reader's
> current key buffer, position, and length. There is a 384 byte overhead per
> segment since the account is done with the DataInputBuffer class which
> derives from DataInputStream which has underlying byte[80] and char[80] among
> significant pieces. This jira aims to reduce the overhead per segment
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)