[ https://issues.apache.org/jira/browse/MAPREDUCE-7278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17096650#comment-17096650 ]
Tarun Parimi commented on MAPREDUCE-7278: ----------------------------------------- Mutiple task attempts are tried simultaneously for a single attempt failure. This is because {{TaskAttemptImpl#notifyTaskAttemptFailed()}} is called twice, when TaskAttempt transitions from, 1. RUNNING -> FAIL_FINISHING_CONTAINER 2. FAIL_CONTAINER_CLEANUP -> FAIL_TASK_CLEANUP -> FAILED But this issue is seen only after the MAPREDUCE-6485 code change, since it allows multiple new taskattempts to launch even if there are existing taskattempts for that task. It checks whether a container is assigned, but in a fully utilized cluster container assignment can take a few minutes and so we can have two taskattempts launched when the above transitions occur. > Speculative execution behavior is observed even when > mapreduce.map.speculative and mapreduce.reduce.speculative are false > ------------------------------------------------------------------------------------------------------------------------- > > Key: MAPREDUCE-7278 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-7278 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: task > Affects Versions: 2.8.0 > Reporter: Tarun Parimi > Priority: Major > Attachments: Screen Shot 2020-04-30 at 8.04.27 PM.png > > > When a failed task attempt container is stuck in FAIL_FINISHING_CONTAINER > state for some time, we observe two task attempts are launched simultaneously > even when speculative execution is disabled. > This results in the below message shown in the killed attempts, indicating > speculation has occurred. This is an issue for jobs which require speculative > execution to be strictly disabled. > !Screen Shot 2020-04-30 at 8.04.27 PM.png! > > -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org