[
https://issues.apache.org/jira/browse/MAPREDUCE-6569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15048602#comment-15048602
]
Mohammad Shahid Khan commented on MAPREDUCE-6569:
-------------------------------------------------
As per log the am is killed not because of the memory.
Please check in the log do you message like : "Expired: container_id Timed out
after 600 secs"
Any AM that doesn’t heartbeat within a configured interval of time, by default
10 minutes, is deemed dead and is expired by the RM.
Can you get the thread dump of the AMContainer to check why it is stucked.
> ApplicationMaster Stuck 10 min And RM kill the AM
> --------------------------------------------------
>
> Key: MAPREDUCE-6569
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6569
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: applicationmaster
> Affects Versions: 2.2.0
> Environment: CentOS6.5 Hadoop-2.2.0 DefaultContainer
> Reporter: zhangyubiao
> Attachments: 297586_0001.log
>
>
> ApplicationMaster Stuck 10 min
> and printLogs
> 2015-12-09 02:45:04,160 INFO
> org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl:
> Memory usage of ProcessTree 30634 for container-
> id container_1449229056278_297586_01_000001: 1.1 GB of 3 GB physical memory
> used; 3.5 GB of 9.3 GB virtual memory used
> 2015-12-09 02:45:07,527 INFO
> org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl:
> Memory usage of ProcessTree 30634 for container-
> id container_1449229056278_297586_01_000001: 1.1 GB of 3 GB physical memory
> used; 3.5 GB of 9.3 GB virtual memory used
> 2015-12-09 02:45:10,888 INFO
> org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl:
> Memory usage of ProcessTree 30634 for container-
> id container_1449229056278_297586_01_000001: 1.1 GB of 3 GB physical memory
> used; 3.5 GB of 9.3 GB virtual memory used
> 2015-12-09 02:45:14,274 INFO
> org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl:
> Memory usage of ProcessTree 30634 for container-
> id container_1449229056278_297586_01_000001: 1.1 GB of 3 GB physical memory
> used; 3.5 GB of 9.3 GB virtual memory used
> 2015-12-09 02:45:17,625 INFO
> org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl:
> Memory usage of ProcessTree 30634 for container-
> id container_1449229056278_297586_01_000001: 1.1 GB of 3 GB physical memory
> used; 3.5 GB of 9.3 GB virtual memory used
> And RM kill the AM
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)