[ 
https://issues.apache.org/jira/browse/MAPREDUCE-7278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17096650#comment-17096650
 ] 

Tarun Parimi commented on MAPREDUCE-7278:
-----------------------------------------

Mutiple task attempts are tried simultaneously for a single attempt failure. 
This is because {{TaskAttemptImpl#notifyTaskAttemptFailed()}} is called twice, 
when TaskAttempt transitions from,
1. RUNNING -> FAIL_FINISHING_CONTAINER
2. FAIL_CONTAINER_CLEANUP -> FAIL_TASK_CLEANUP  -> FAILED

But this issue is seen only after the MAPREDUCE-6485 code change, since it 
allows multiple new taskattempts to launch even if there are existing 
taskattempts for that task. It checks whether a container is assigned, but in a 
fully utilized cluster container assignment can take a few minutes and so we 
can have two taskattempts launched when the above transitions occur.







> Speculative execution behavior is observed even when 
> mapreduce.map.speculative and mapreduce.reduce.speculative are false
> -------------------------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-7278
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7278
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: task
>    Affects Versions: 2.8.0
>            Reporter: Tarun Parimi
>            Priority: Major
>         Attachments: Screen Shot 2020-04-30 at 8.04.27 PM.png
>
>
> When a failed task attempt container is stuck in FAIL_FINISHING_CONTAINER 
> state for some time, we observe two task attempts are launched simultaneously 
> even when speculative execution is disabled.
> This results in the below message shown in the killed attempts, indicating 
> speculation has occurred. This is an issue for jobs which require speculative 
> execution to be strictly disabled.
>   !Screen Shot 2020-04-30 at 8.04.27 PM.png!
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org

Reply via email to