Mingzhe Hao created MAPREDUCE-5863:
--------------------------------------
Summary: Killing task attempts while speculation is enabled can
cause the job to fail
Key: MAPREDUCE-5863
URL: https://issues.apache.org/jira/browse/MAPREDUCE-5863
Project: Hadoop Map/Reduce
Issue Type: Bug
Affects Versions: 2.4.0
Reporter: Mingzhe Hao
There could be a race condition when a T_ADD_SPEC_ATTEMPT is being fired, the
task gets succeeded and then killed by the client. In that case, the task state
changes from SUCCEEDED to SCHEDULED, and then task gets a T_ADD_SPEC_ATTEMPT
event, which is invalid for SCHEDULED state.
1. Task is running.
2. Speculator fires a T_ADD_SPEC_ATTEMPT
3. Before task receives T_ADD_SPEC_ATTEMPT, it succeeds
4. Succeeded TA receives TA_KILL from client. Now the task is at SCHEDULED
state.
5. Task receives T_ADD_SPEC_ATTEMPT, since this is an unexpected event, the job
fails.
--
This message was sent by Atlassian JIRA
(v6.2#6252)