[
https://issues.apache.org/jira/browse/YARN-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13932105#comment-13932105
]
Jian He commented on YARN-1816:
-------------------------------
The log shows on recovery, Attempt shows finished, but application stucks at
accepted state.
The reason is RMApp fails to handle the AttemptFinished event, when attempt is
recovering and send the AttemptFinished event back to RMApp.
> Succeeded application remains in accepted
> -----------------------------------------
>
> Key: YARN-1816
> URL: https://issues.apache.org/jira/browse/YARN-1816
> Project: Hadoop YARN
> Issue Type: Bug
> Affects Versions: 2.4.0
> Reporter: Arpit Gupta
> Assignee: Jian He
> Attachments: YARN-1816.1.patch
>
>
> {code}
> 2014-03-10 18:07:31,944|beaver.machine|INFO|Application-Id
> Application-Name Application-Type User Queue
> State Final-State Progress
> Tracking-URL
> 2014-03-10 18:07:31,945|beaver.machine|INFO|application_1394449508064_0008
> test_mapred_ha_multiple_job_nn-rm-1-min-5-jobs_1394449960-4
> MAPREDUCE hrt_qa default ACCEPTED
> SUCCEEDED 100%
> http://hostname:19888/jobhistory/job/job_1394449508064_0008
> 2014-03-10 18:08:02,125|beaver.machine|INFO|RUNNING: /usr/bin/yarn
> application -list -appStates NEW,NEW_SAVING,SUBMITTED,ACCEPTED,RUNNING
> 2014-03-10 18:08:03,198|beaver.machine|INFO|14/03/10 18:08:03 INFO
> client.ConfiguredRMFailoverProxyProvider: Failing over to rm2
> 2014-03-10 18:08:03,238|beaver.machine|INFO|Total number of applications
> (application-types: [] and states: [NEW, NEW_SAVING, SUBMITTED, ACCEPTED,
> RUNNING]):1
> 2014-03-10 18:08:03,239|beaver.machine|INFO|Application-Id
> Application-Name Application-Type User Queue
> State Final-State Progress
> Tracking-URL
> 2014-03-10 18:08:03,239|beaver.machine|INFO|application_1394449508064_0008
> test_mapred_ha_multiple_job_nn-rm-1-min-5-jobs_1394449960-4
> MAPREDUCE hrt_qa default ACCEPTED
> SUCCEEDED 100%
> http://hostname:19888/jobhistory/job/job_1394449508064_0008
> 2014-03-10 18:08:33,390|beaver.machine|INFO|RUNNING: /usr/bin/yarn
> application -list -appStates NEW,NEW_SAVING,SUBMITTED,ACCEPTED,RUNNING
> 2014-03-10 18:08:34,437|beaver.machine|INFO|14/03/10 18:08:34 INFO
> client.ConfiguredRMFailoverProxyProvider: Failing over to rm2
> 2014-03-10 18:08:34,477|beaver.machine|INFO|Total number of applications
> (application-types: [] and states: [NEW, NEW_SAVING, SUBMITTED, ACCEPTED,
> RUNNING]):1
> 2014-03-10 18:08:34,477|beaver.machine|INFO|Application-Id
> Application-Name Application-Type User Queue
> State Final-State Progress
> Tracking-URL
> 2014-03-10 18:08:34,478|beaver.machine|INFO|application_1394449508064_0008
> test_mapred_ha_multiple_job_nn-rm-1-min-5-jobs_1394449960-4
> MAPREDUCE hrt_qa default ACCEPTED
> SUCCEEDED 100%
> http://hostname:19888/jobhistory/job/job_1394449508064_0008
> 2014-03-10 18:09:04,628|beaver.machine|INFO|RUNNING: /usr/bin/yarn
> application -list -appStates NEW,NEW_SAVING,SUBMITTED,ACCEPTED,RUNNING
> 2014-03-10 18:09:05,688|beaver.machine|INFO|14/03/10 18:09:05 INFO
> client.ConfiguredRMFailoverProxyProvider: Failing over to rm2
> 2014-03-10 18:09:05,728|beaver.machine|INFO|Total number of applications
> (application-types: [] and states: [NEW, NEW_SAVING, SUBMITTED, ACCEPTED,
> RUNNING]):1
> 2014-03-10 18:09:05,728|beaver.machine|INFO|Application-Id
> Application-Name Application-Type User Queue
> State Final-State Progress
> Tracking-URL
> 2014-03-10 18:09:05,729|beaver.machine|INFO|application_1394449508064_0008
> test_mapred_ha_multiple_job_nn-rm-1-min-5-jobs_1394449960-4
> MAPREDUCE hrt_qa default ACCEPTED
> SUCCEEDED 100%
> http://hostname:19888/jobhistory/job/job_1394449508064_0008
> 2014-03-10 18:09:35,879|beaver.machine|INFO|RUNNING: /usr/bin/yarn
> application -list -appStates NEW,NEW_SAVING,SUBMITTED,ACCEPTED,RUNNING
> 2014-03-10 18:09:36,951|beaver.machine|INFO|14/03/10 18:09:36 INFO
> client.ConfiguredRMFailoverProxyProvider: Failing over to rm2
> 2014-03-10 18:09:36,992|beaver.machine|INFO|Total number of applications
> (application-types: [] and states: [NEW, NEW_SAVING, SUBMITTED, ACCEPTED,
> RUNNING]):1
> 2014-03-10 18:09:36,993|beaver.machine|INFO|Application-Id
> Application-Name Application-Type User Queue
> State Final-State Progress
> Tracking-URL
> 2014-03-10 18:09:36,993|beaver.machine|INFO|application_1394449508064_0008
> test_mapred_ha_multiple_job_nn-rm-1-min-5-jobs_1394449960-4
> MAPREDUCE hrt_qa default ACCEPTED
> SUCCEEDED 100%
> http://hostname:19888/jobhistory/job/job_1394449508064_0008
> 2014-03-10 18:10:07,142|beaver.machine|INFO|RUNNING: /usr/bin/yarn
> application -list -appStates NEW,NEW_SAVING,SUBMITTED,ACCEPTED,RUNNING
> 2014-03-10 18:10:08,201|beaver.machine|INFO|14/03/10 18:10:08 INFO
> client.ConfiguredRMFailoverProxyProvider: Failing over to rm2
> 2014-03-10 18:10:08,242|beaver.machine|INFO|Total number of applications
> (application-types: [] and states: [NEW, NEW_SAVING, SUBMITTED, ACCEPTED,
> RUNNING]):1
> 2014-03-10 18:10:08,242|beaver.machine|INFO|Application-Id
> Application-Name Application-Type User Queue
> State Final-State Progress
> Tracking-URL
> 2014-03-10 18:10:08,242|beaver.machine|INFO|application_1394449508064_0008
> test_mapred_ha_multiple_job_nn-rm-1-min-5-jobs_1394449960-4
> MAPREDUCE hrt_qa default ACCEPTED
> SUCCEEDED 100%
> http://hostname:19888/jobhistory/job/job_1394449508064_0008
> 2014-03-10 18:10:38,392|beaver.machine|INFO|RUNNING: /usr/bin/yarn
> application -list -appStates NEW,NEW_SAVING,SUBMITTED,ACCEPTED,RUNNING
> 2014-03-10 18:10:39,443|beaver.machine|INFO|14/03/10 18:10:39 INFO
> client.ConfiguredRMFailoverProxyProvider: Failing over to rm2
> 2014-03-10 18:10:39,484|beaver.machine|INFO|Total number of applications
> (application-types: [] and states: [NEW, NEW_SAVING, SUBMITTED, ACCEPTED,
> RUNNING]):1
> 2014-03-10 18:10:39,484|beaver.machine|INFO|Application-Id
> Application-Name Application-Type User Queue
> State Final-State Progress
> Tracking-URL
> 2014-03-10 18:10:39,485|beaver.machine|INFO|application_1394449508064_0008
> test_mapred_ha_multiple_job_nn-rm-1-min-5-jobs_1394449960-4
> MAPREDUCE hrt_qa default ACCEPTED
> SUCCEEDED 100%
> http://hostname:19888/jobhistory/job/job_1394449508064_0008
> {code}
--
This message was sent by Atlassian JIRA
(v6.2#6252)