[ 
https://issues.apache.org/jira/browse/TEZ-3634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated TEZ-3634:
--------------------------------
    Attachment: TEZ-3634.01.patch

[~rajesh.balamohan], [~gopalv] - could you please take a look.

> reduce the buffer sizes in PipelinedSorter by a small amount
> ------------------------------------------------------------
>
>                 Key: TEZ-3634
>                 URL: https://issues.apache.org/jira/browse/TEZ-3634
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>         Attachments: TEZ-3634.01.patch
>
>
> When running in large JVMs with the G1GC, the regionsize maxes out at 32MB.
> PipelinedSorter does allocations of 32MB and 256MB. The actual allocation 
> ends up being 24 bytes over 32M/256M. As a result the 32M allocation ends up 
> taking 2 regions (or 64M), and the 256M ends up taking 9 regions instead of 
> 8. Also leads to fragmentation of the heap, resulting in more Full GCs than 
> required.
> {code}
> Live humongous region 2 size 33554456 start 0x00007f4918000000 length 2 with 
> remset 0 code roots 0 is marked 0 reclaim candidate 0 type array 1
> {code}
> {code}
> Dead humongous region 111 size 268435480 start 0x00007f49f2000000 length 9 
> with remset 0 code roots 0 is marked 0 reclaim candidate 1 type array 1
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to