[ 
https://issues.apache.org/jira/browse/YARN-9621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16864042#comment-16864042
 ] 

Prabhu Joseph commented on YARN-9621:
-------------------------------------

This time it got failed with AM container exceeding physical memory.

 
{code:java}
2019-06-14 12:23:10,650 WARN [Container Monitor] monitor.ContainersMonitorImpl 
(ContainersMonitorImpl.java:checkLimit(702)) - Container 
[pid=23023,containerID=container_1560514968391_0001_02_000001] is running 
60817408B beyond the 'PHYSICAL' memory limit. Current usage: 186 MB of 128 MB 
physical memory used; 1.8 GB of 268.8 MB virtual memory used. Killing 
container. Dump of the process-tree for container_1560514968391_0001_02_000001 
: |- PID PPID PGRPID SESSID CMD_NAME USER_MODE_TIME(MILLIS) SYSTEM_TIME(MILLIS) 
VMEM_USAGE(BYTES) RSSMEM_USAGE(PAGES) FULL_CMD_LINE |- 23032 23023 23023 23023 
(java) 1010 42 1903222784 46888 /usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java 
-Xmx100m org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster 
--container_type GUARANTEED --container_memory 10 --container_vcores 1 
--num_containers 1 --placement_spec 
ems9MSxOT1RJTixOT0RFLHprOnNwYXJrPTEsTk9USU4sTk9ERSx6aw== --priority 0 |- 23023 
23021 23023 23023 (bash) 1 0 18731008 728 /bin/bash -c 
/usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java -Xmx100m 
org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster 
--container_type GUARANTEED --container_memory 10 --container_vcores 1 
--num_containers 1 --placement_spec 
ems9MSxOT1RJTixOT0RFLHprOnNwYXJrPTEsTk9USU4sTk9ERSx6aw== --priority 0 
1>/testptch/hadoop/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/target/TestDistributedShell/TestDistributedShell-logDir-nm-1_0/application_1560514968391_0001/container_1560514968391_0001_02_000001/AppMaster.stdout
 
2>/testptch/hadoop/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/target/TestDistributedShell/TestDistributedShell-logDir-nm-1_0/application_1560514968391_0001/container_1560514968391_0001_02_000001/AppMaster.stderr
 {code}

> FIX TestDSWithMultipleNodeManager.testDistributedShellWithPlacementConstraint 
> on branch-3.1
> -------------------------------------------------------------------------------------------
>
>                 Key: YARN-9621
>                 URL: https://issues.apache.org/jira/browse/YARN-9621
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: distributed-shell, test
>    Affects Versions: 3.1.2
>            Reporter: Peter Bacsko
>            Assignee: Prabhu Joseph
>            Priority: Major
>         Attachments: YARN-9621-branch-3.1.001.patch, 
> YARN-9621-branch-3.1.002.patch, YARN-9621-branch-3.1.003.patch
>
>
> Testcase 
> {{TestDSWithMultipleNodeManager.testDistributedShellWithPlacementConstraint}} 
> seems to constantly fail on branch 3.1. I believe it was introduced by 
> YARN-9253.
> {noformat}
> testDistributedShellWithPlacementConstraint(org.apache.hadoop.yarn.applications.distributedshell.TestDSWithMultipleNodeManager)
>   Time elapsed: 24.636 s  <<< FAILURE!
> java.lang.AssertionError: expected:<1> but was:<2>
>       at org.junit.Assert.fail(Assert.java:88)
>       at org.junit.Assert.failNotEquals(Assert.java:743)
>       at org.junit.Assert.assertEquals(Assert.java:118)
>       at org.junit.Assert.assertEquals(Assert.java:555)
>       at org.junit.Assert.assertEquals(Assert.java:542)
>       at 
> org.apache.hadoop.yarn.applications.distributedshell.TestDSWithMultipleNodeManager.testDistributedShellWithPlacementConstraint(TestDSWithMultipleNodeManager.java:178)
>       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>       at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>       at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>       at java.lang.reflect.Method.invoke(Method.java:498)
>       at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
>       at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>       at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
>       at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>       at 
> org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to