[jira] [Resolved] (YARN-501) Application Master getting killed randomly reporting excess usage of memory
[ https://issues.apache.org/jira/browse/YARN-501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinod Kumar Vavilapalli resolved YARN-501. -- Resolution: Not A Problem Haven't gotten a response on my last comment in a while. IAC, it is unlikely YARN can do much in this situation. Closing this again as not-a-problem. Application Master getting killed randomly reporting excess usage of memory --- Key: YARN-501 URL: https://issues.apache.org/jira/browse/YARN-501 Project: Hadoop YARN Issue Type: Bug Components: applications/distributed-shell, nodemanager Affects Versions: 2.0.3-alpha Reporter: Krishna Kishore Bonagiri Assignee: Omkar Vinit Joshi I am running a date command using the Distributed Shell example in a loop of 500 times. It ran successfully all the times except one time where it gave the following error. 2013-03-22 04:33:25,280 INFO [main] distributedshell.Client (Client.java:monitorApplication(605)) - Got application report from ASM for, appId=222, clientToken=null, appDiagnostics=Application application_1363938200742_0222 failed 1 times due to AM Container for appattempt_1363938200742_0222_01 exited with exitCode: 143 due to: Container [pid=21141,containerID=container_1363938200742_0222_01_01] is running beyond virtual memory limits. Current usage: 47.3 Mb of 128 Mb physical memory used; 611.6 Mb of 268.8 Mb virtual memory used. Killing container. Dump of the process-tree for container_1363938200742_0222_01_01 : |- PID PPID PGRPID SESSID CMD_NAME USER_MODE_TIME(MILLIS) SYSTEM_TIME(MILLIS) VMEM_USAGE(BYTES) RSSMEM_USAGE(PAGES) FULL_CMD_LINE |- 21147 21141 21141 21141 (java) 244 12 532643840 11802 /home_/dsadm/yarn/jdk//bin/java -Xmx128m org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster --container_memory 10 --num_containers 2 --priority 0 --shell_command date |- 21141 8433 21141 21141 (bash) 0 0 108642304 298 /bin/bash -c /home_/dsadm/yarn/jdk//bin/java -Xmx128m org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster --container_memory 10 --num_containers 2 --priority 0 --shell_command date 1/tmp/logs/application_1363938200742_0222/container_1363938200742_0222_01_01/AppMaster.stdout 2/tmp/logs/application_1363938200742_0222/container_1363938200742_0222_01_01/AppMaster.stderr -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Resolved] (YARN-501) Application Master getting killed randomly reporting excess usage of memory
[ https://issues.apache.org/jira/browse/YARN-501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi Prakash resolved YARN-501. --- Resolution: Not A Problem Hi Krishna, Please set yarn.app.mapreduce.am.command-opts to include a more reasonable -Xmx. Or perhaps increase yarn.nodemanager.vmem-pmem-ratio to a higher ratio if you don't need that much physical memory. Application Master getting killed randomly reporting excess usage of memory --- Key: YARN-501 URL: https://issues.apache.org/jira/browse/YARN-501 Project: Hadoop YARN Issue Type: Bug Components: applications/distributed-shell, nodemanager Affects Versions: 2.0.3-alpha Reporter: Krishna Kishore Bonagiri I am running a date command using the Distributed Shell example in a loop of 500 times. It ran successfully all the times except one time where it gave the following error. 2013-03-22 04:33:25,280 INFO [main] distributedshell.Client (Client.java:monitorApplication(605)) - Got application report from ASM for, appId=222, clientToken=null, appDiagnostics=Application application_1363938200742_0222 failed 1 times due to AM Container for appattempt_1363938200742_0222_01 exited with exitCode: 143 due to: Container [pid=21141,containerID=container_1363938200742_0222_01_01] is running beyond virtual memory limits. Current usage: 47.3 Mb of 128 Mb physical memory used; 611.6 Mb of 268.8 Mb virtual memory used. Killing container. Dump of the process-tree for container_1363938200742_0222_01_01 : |- PID PPID PGRPID SESSID CMD_NAME USER_MODE_TIME(MILLIS) SYSTEM_TIME(MILLIS) VMEM_USAGE(BYTES) RSSMEM_USAGE(PAGES) FULL_CMD_LINE |- 21147 21141 21141 21141 (java) 244 12 532643840 11802 /home_/dsadm/yarn/jdk//bin/java -Xmx128m org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster --container_memory 10 --num_containers 2 --priority 0 --shell_command date |- 21141 8433 21141 21141 (bash) 0 0 108642304 298 /bin/bash -c /home_/dsadm/yarn/jdk//bin/java -Xmx128m org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster --container_memory 10 --num_containers 2 --priority 0 --shell_command date 1/tmp/logs/application_1363938200742_0222/container_1363938200742_0222_01_01/AppMaster.stdout 2/tmp/logs/application_1363938200742_0222/container_1363938200742_0222_01_01/AppMaster.stderr -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira