[
https://issues.apache.org/jira/browse/YARN-295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13683479#comment-13683479
]
Jason Lowe commented on YARN-295:
---------------------------------
Is it guaranteed that if we get CONTAINER_FINISHED in the ALLOCATED state that
we will subsequently receive LAUNCH_FAILED? I'm not sure that's guaranteed to
be the case.
Consider an AM that instantly crashes, where LAUNCHED and CONTAINER_FINISHED
are racing. If we ignore the CONTAINER_FINISHED event then later receive
LAUNCHED, the RM will sit around for the AM expiry interval before
re-discovering the attempt failed and launch another one. The
CONTAINER_FINISHED event is a clear indication the AM failed to launch --
wouldn't we want to treat it as such in the ALLOCATED state?
> Resource Manager throws InvalidStateTransitonException: Invalid event:
> CONTAINER_FINISHED at ALLOCATED for RMAppAttemptImpl
> ---------------------------------------------------------------------------------------------------------------------------
>
> Key: YARN-295
> URL: https://issues.apache.org/jira/browse/YARN-295
> Project: Hadoop YARN
> Issue Type: Sub-task
> Components: resourcemanager
> Affects Versions: 2.0.2-alpha, 2.0.1-alpha
> Reporter: Devaraj K
> Assignee: Mayank Bansal
> Attachments: YARN-295-trunk-1.patch
>
>
> {code:xml}
> 2012-12-28 14:03:56,956 ERROR
> org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl:
> Can't handle this event at current state
> org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event:
> CONTAINER_FINISHED at ALLOCATED
> at
> org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:301)
> at
> org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:43)
> at
> org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:443)
> at
> org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl.handle(RMAppAttemptImpl.java:490)
> at
> org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl.handle(RMAppAttemptImpl.java:80)
> at
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$ApplicationAttemptEventDispatcher.handle(ResourceManager.java:433)
> at
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$ApplicationAttemptEventDispatcher.handle(ResourceManager.java:414)
> at
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:126)
> at
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:662)
> {code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira