[
https://issues.apache.org/jira/browse/YARN-5416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15395930#comment-15395930
]
Junping Du commented on YARN-5416:
----------------------------------
bq. This looks like an exact dup of YARN-1468 which you also filed. Are they
actually different?
Oh. no. YARN-1468 is a very old jira and out of my radar for some reason (I
didn't notice recent comments from Eric there). I think we can close this as
dup of that. What do you think?
bq. Junping Du, is there any reason why we would only add the
waitSchedulerApplicationAttemptStopped call for the first app attempt, but not
for the subsequent ones?
Hi Eric, this is just following the pattern we applied in YARN-4968 which seems
only necessary to wait before launch another AM immediately - that is exactly
where the exception happens. Do you think there are other places we should add?
> TestRMRestart#testRMRestartWaitForPreviousAMToFinish failed intermittently
> due to not wait SchedulerApplicationAttempt to be stopped
> ------------------------------------------------------------------------------------------------------------------------------------
>
> Key: YARN-5416
> URL: https://issues.apache.org/jira/browse/YARN-5416
> Project: Hadoop YARN
> Issue Type: Sub-task
> Components: test, yarn
> Reporter: Junping Du
> Assignee: Junping Du
> Priority: Minor
> Attachments: YARN-5416.patch
>
>
> The test failure stack is:
> Running org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart
> Tests run: 54, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 385.338 sec
> <<< FAILURE! - in org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart
> testRMRestartWaitForPreviousAMToFinish[0](org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart)
> Time elapsed: 43.134 sec <<< FAILURE!
> java.lang.AssertionError: AppAttempt state is not correct (timedout)
> expected:<ALLOCATED> but was:<SCHEDULED>
> at org.junit.Assert.fail(Assert.java:88)
> at org.junit.Assert.failNotEquals(Assert.java:743)
> at org.junit.Assert.assertEquals(Assert.java:118)
> at
> org.apache.hadoop.yarn.server.resourcemanager.MockAM.waitForState(MockAM.java:86)
> at
> org.apache.hadoop.yarn.server.resourcemanager.MockRM.sendAMLaunched(MockRM.java:594)
> at
> org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.launchAM(TestRMRestart.java:1008)
> at
> org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testRMRestartWaitForPreviousAMToFinish(TestRMRestart.java:530)
> This is due to the same issue that partially fixed in YARN-4968
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]