[
https://issues.apache.org/jira/browse/YARN-9621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16864042#comment-16864042
]
Prabhu Joseph commented on YARN-9621:
-------------------------------------
This time it got failed with AM container exceeding physical memory.
{code:java}
2019-06-14 12:23:10,650 WARN [Container Monitor] monitor.ContainersMonitorImpl
(ContainersMonitorImpl.java:checkLimit(702)) - Container
[pid=23023,containerID=container_1560514968391_0001_02_000001] is running
60817408B beyond the 'PHYSICAL' memory limit. Current usage: 186 MB of 128 MB
physical memory used; 1.8 GB of 268.8 MB virtual memory used. Killing
container. Dump of the process-tree for container_1560514968391_0001_02_000001
: |- PID PPID PGRPID SESSID CMD_NAME USER_MODE_TIME(MILLIS) SYSTEM_TIME(MILLIS)
VMEM_USAGE(BYTES) RSSMEM_USAGE(PAGES) FULL_CMD_LINE |- 23032 23023 23023 23023
(java) 1010 42 1903222784 46888 /usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java
-Xmx100m org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster
--container_type GUARANTEED --container_memory 10 --container_vcores 1
--num_containers 1 --placement_spec
ems9MSxOT1RJTixOT0RFLHprOnNwYXJrPTEsTk9USU4sTk9ERSx6aw== --priority 0 |- 23023
23021 23023 23023 (bash) 1 0 18731008 728 /bin/bash -c
/usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java -Xmx100m
org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster
--container_type GUARANTEED --container_memory 10 --container_vcores 1
--num_containers 1 --placement_spec
ems9MSxOT1RJTixOT0RFLHprOnNwYXJrPTEsTk9USU4sTk9ERSx6aw== --priority 0
1>/testptch/hadoop/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/target/TestDistributedShell/TestDistributedShell-logDir-nm-1_0/application_1560514968391_0001/container_1560514968391_0001_02_000001/AppMaster.stdout
2>/testptch/hadoop/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/target/TestDistributedShell/TestDistributedShell-logDir-nm-1_0/application_1560514968391_0001/container_1560514968391_0001_02_000001/AppMaster.stderr
{code}
> FIX TestDSWithMultipleNodeManager.testDistributedShellWithPlacementConstraint
> on branch-3.1
> -------------------------------------------------------------------------------------------
>
> Key: YARN-9621
> URL: https://issues.apache.org/jira/browse/YARN-9621
> Project: Hadoop YARN
> Issue Type: Bug
> Components: distributed-shell, test
> Affects Versions: 3.1.2
> Reporter: Peter Bacsko
> Assignee: Prabhu Joseph
> Priority: Major
> Attachments: YARN-9621-branch-3.1.001.patch,
> YARN-9621-branch-3.1.002.patch, YARN-9621-branch-3.1.003.patch
>
>
> Testcase
> {{TestDSWithMultipleNodeManager.testDistributedShellWithPlacementConstraint}}
> seems to constantly fail on branch 3.1. I believe it was introduced by
> YARN-9253.
> {noformat}
> testDistributedShellWithPlacementConstraint(org.apache.hadoop.yarn.applications.distributedshell.TestDSWithMultipleNodeManager)
> Time elapsed: 24.636 s <<< FAILURE!
> java.lang.AssertionError: expected:<1> but was:<2>
> at org.junit.Assert.fail(Assert.java:88)
> at org.junit.Assert.failNotEquals(Assert.java:743)
> at org.junit.Assert.assertEquals(Assert.java:118)
> at org.junit.Assert.assertEquals(Assert.java:555)
> at org.junit.Assert.assertEquals(Assert.java:542)
> at
> org.apache.hadoop.yarn.applications.distributedshell.TestDSWithMultipleNodeManager.testDistributedShellWithPlacementConstraint(TestDSWithMultipleNodeManager.java:178)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
> at
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
> at
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
> at
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
> at
> org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
> {noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]