[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12920183#action_12920183
 ] 

Kang Xiao commented on MAPREDUCE-2129:
--------------------------------------

Here is an example:

* The job has 100 maps and no reduce
* mapreduce.job.committer.setup.cleanup.needed=false
* mapreduce.map/reduce.failures.maxpercent=5, so at most 5 map tip is allowed 
to fail
* 99 maps successed
* the last map failed 4 attempts, then the last TIP failed
* the failed TIP will not cause the job to fail since 1 < 5
* no cleanup task will be lanuched since 
mapreduce.job.committer.setup.cleanup.needed=false
* jobComplete() at the tail of completedTask() has no chance to be invoked, so 
job hangs at RUNNING state


> Job may hang if mapreduce.job.committer.setup.cleanup.needed=true and 
> mapreduce.map/reduce.failures.maxpercent>0
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-2129
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2129
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: jobtracker
>    Affects Versions: 0.20.1, 0.20.2, 0.20.3, 0.21.0, 0.21.1, 0.22.0
>            Reporter: Kang Xiao
>
> Job may hang at RUNNING state if 
> mapreduce.job.committer.setup.cleanup.needed=true and 
> mapreduce.map/reduce.failures.maxpercent>0. It happens when some tasks fail 
> but havent reached failures.maxpercent.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to