[
https://issues.apache.org/jira/browse/MAPREDUCE-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16876091#comment-16876091
]
yanglongfei commented on MAPREDUCE-7222:
----------------------------------------
[~anupamseth] Could you help to give some advice to the problem? I found
related code is
job.scheduleTasks(job.mapTasks, job.numReduceTasks == 0);
job.scheduleTasks(job.reduceTasks, true);
> Map tasks' outputs can not be recovered when ApplicationMaster relaunched
> --------------------------------------------------------------------------
>
> Key: MAPREDUCE-7222
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-7222
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2
> Affects Versions: 2.7.3
> Reporter: yanglongfei
> Priority: Blocker
>
> When AM crashes, Yarn would launch a new AM instance and recover all its
> scheduled tasks. However mapper tasks's committed output files are not
> recovered when the number of reducers > 0. In my application which output
> files from mapper and make use of reducer to collect statistics not able to
> fully recover from the AM crash, and resulting in data from the previous
> completed mapper tasks get lost in the final output dir.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]