[ 
https://issues.apache.org/jira/browse/YARN-5994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15742822#comment-15742822
 ] 

Eric Badger commented on YARN-5994:
-----------------------------------

This test is racy and so the issue with it only manifests sometimes. YARN-5773 
put in a change so that {{verifyAMLimitForLeafQueue}} registers a 2GB node. 
{{queueA.getAMResourceLimit}} should return 1024 for memory, but is racy and 
almost always gets set to 0 for memory. So when the test submits the app with 
{{amResource2}}, that resource is set to 2048, instead of 3072. In the 
instances that it gets correctly set to 3072, the test errors out that it is 
trying to use more than the max memory available. 

The bandaid fix would be to increase the node size to 4GB, but that doesn't fix 
the race in why {{queueA.getAMResourceLimit}} isn't getting updated before it 
is being accessed in the test. 

[~wangda], [~varun_saxena], [~bibinchundatt], [~rohithsharma], 
[~Naganarasimha], [~sunilg]: Does this analysis make sense? Should we just add 
a {{waitFor}} to wait until {{queueA.getAMResourceLimit}} isn't 0,0? 

> TestCapacityScheduler.testAMLimitUsage fails intermittently
> -----------------------------------------------------------
>
>                 Key: YARN-5994
>                 URL: https://issues.apache.org/jira/browse/YARN-5994
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Eric Badger
>            Assignee: Eric Badger
>
> {noformat}
> java.lang.AssertionError: app shouldn't be null
>       at org.junit.Assert.fail(Assert.java:88)
>       at org.junit.Assert.assertTrue(Assert.java:41)
>       at org.junit.Assert.assertNotNull(Assert.java:621)
>       at 
> org.apache.hadoop.yarn.server.resourcemanager.MockRM.waitForState(MockRM.java:169)
>       at 
> org.apache.hadoop.yarn.server.resourcemanager.MockRM.submitApp(MockRM.java:577)
>       at 
> org.apache.hadoop.yarn.server.resourcemanager.MockRM.submitApp(MockRM.java:488)
>       at 
> org.apache.hadoop.yarn.server.resourcemanager.MockRM.submitApp(MockRM.java:395)
>       at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.TestCapacityScheduler.verifyAMLimitForLeafQueue(TestCapacityScheduler.java:3389)
>       at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.TestCapacityScheduler.testAMLimitUsage(TestCapacityScheduler.java:3251)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to