[
https://issues.apache.org/jira/browse/MAPREDUCE-2129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12920183#action_12920183
]
Kang Xiao commented on MAPREDUCE-2129:
--------------------------------------
Here is an example:
* The job has 100 maps and no reduce
* mapreduce.job.committer.setup.cleanup.needed=false
* mapreduce.map/reduce.failures.maxpercent=5, so at most 5 map tip is allowed
to fail
* 99 maps successed
* the last map failed 4 attempts, then the last TIP failed
* the failed TIP will not cause the job to fail since 1 < 5
* no cleanup task will be lanuched since
mapreduce.job.committer.setup.cleanup.needed=false
* jobComplete() at the tail of completedTask() has no chance to be invoked, so
job hangs at RUNNING state
> Job may hang if mapreduce.job.committer.setup.cleanup.needed=true and
> mapreduce.map/reduce.failures.maxpercent>0
> ----------------------------------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-2129
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2129
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: jobtracker
> Affects Versions: 0.20.1, 0.20.2, 0.20.3, 0.21.0, 0.21.1, 0.22.0
> Reporter: Kang Xiao
>
> Job may hang at RUNNING state if
> mapreduce.job.committer.setup.cleanup.needed=true and
> mapreduce.map/reduce.failures.maxpercent>0. It happens when some tasks fail
> but havent reached failures.maxpercent.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.