[jira] [Commented] (PIG-4043) JobClient.getMap/ReduceTaskReports() causes OOM for jobs with a large number of tasks

Rohini Palaniswamy (JIRA) Sun, 29 Jun 2014 20:18:07 -0700

    [ 
https://issues.apache.org/jira/browse/PIG-4043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14047334#comment-14047334
 ]


Rohini Palaniswamy commented on PIG-4043:
-----------------------------------------

+1. 

Changing getAvgREduceTime() to getAvgReduceTime() breaks Oozie. So need to keep 
it as it is. Can you revert that part of the patch and check in. 

> JobClient.getMap/ReduceTaskReports() causes OOM for jobs with a large number 
> of tasks
> -------------------------------------------------------------------------------------
>
>                 Key: PIG-4043
>                 URL: https://issues.apache.org/jira/browse/PIG-4043
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Cheolsoo Park
>            Assignee: Cheolsoo Park
>             Fix For: 0.14.0
>
>         Attachments: PIG-4043-1.patch, PIG-4043-2.patch, PIG-4043-3.patch, 
> PIG-4043-4.patch, heapdump.png
>
>
> With Hadoop 2.4, I often see Pig client fails due to OOM when there are many 
> tasks (~100K) with 1GB heap size.
> The heap dump (attached) shows that TaskReport[] occupies about 80% of heap 
> space at the time of OOM.
> The problem is that JobClient.getMap/ReduceTaskReports() returns an array of 
> TaskReport objects, which can be huge if the number of task is large.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (PIG-4043) JobClient.getMap/ReduceTaskReports() causes OOM for jobs with a large number of tasks

Reply via email to