NM does not communicate Container crash to RM
---------------------------------------------
Key: MAPREDUCE-2875
URL: https://issues.apache.org/jira/browse/MAPREDUCE-2875
Project: Hadoop Map/Reduce
Issue Type: Bug
Reporter: Sharad Agarwal
Fix For: 0.23.0
Faulty container crash detection code path in NodeManager.
Steps:
Run a job.
Kill the AM container in NM.
NM logs has:
org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event:
CONTAINER_KILLED_ON_REQUEST at RUNNING
at
org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:297)
at
org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:39)
at
org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:439)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl.handle(ContainerImpl.java:685)
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira