Krishna Kishore Bonagiri created YARN-501:
---------------------------------------------
Summary: Application Master getting killed randomly reporting
excess usage of memory
Key: YARN-501
URL: https://issues.apache.org/jira/browse/YARN-501
Project: Hadoop YARN
Issue Type: Bug
Components: applications/distributed-shell, nodemanager
Affects Versions: 2.0.3-alpha
Reporter: Krishna Kishore Bonagiri
I am running a date command using the Distributed Shell example in a loop of
500 times. It ran successfully all the times except one time where it gave the
following error.
2013-03-22 04:33:25,280 INFO [main] distributedshell.Client
(Client.java:monitorApplication(605)) - Got application report from ASM for,
appId=222, clientToken=null, appDiagnostics=Application
application_1363938200742_0222 failed 1 times due to AM Container for
appattempt_1363938200742_0222_000001 exited with exitCode: 143 due to:
Container [pid=21141,containerID=container_1363938200742_0222_01_000001] is
running beyond virtual memory limits. Current usage: 47.3 Mb of 128 Mb physical
memory used; 611.6 Mb of 268.8 Mb virtual memory used. Killing container.
Dump of the process-tree for container_1363938200742_0222_01_000001 :
|- PID PPID PGRPID SESSID CMD_NAME USER_MODE_TIME(MILLIS)
SYSTEM_TIME(MILLIS) VMEM_USAGE(BYTES) RSSMEM_USAGE(PAGES) FULL_CMD_LINE
|- 21147 21141 21141 21141 (java) 244 12 532643840 11802
/home_/dsadm/yarn/jdk//bin/java -Xmx128m
org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster
--container_memory 10 --num_containers 2 --priority 0 --shell_command date
|- 21141 8433 21141 21141 (bash) 0 0 108642304 298 /bin/bash -c
/home_/dsadm/yarn/jdk//bin/java -Xmx128m
org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster
--container_memory 10 --num_containers 2 --priority 0 --shell_command date
1>/tmp/logs/application_1363938200742_0222/container_1363938200742_0222_01_000001/AppMaster.stdout
2>/tmp/logs/application_1363938200742_0222/container_1363938200742_0222_01_000001/AppMaster.stderr
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira