[
https://issues.apache.org/jira/browse/MAPREDUCE-5718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14305036#comment-14305036
]
Yang Hao commented on MAPREDUCE-5718:
-------------------------------------
Hi, I have read the comment twice. It may not be safe to recover under this
situation.
I'm wondering whether you are worring about missing data. But during the
recovery, task output data will be moved to the new output dir, just as the
normal AM failover does, then it's safe to recover
Our cluster will add this feature. Can you give more specific information on
the unsafe situation? Thanks a lot
> MR job will fail after commit fail
> ----------------------------------
>
> Key: MAPREDUCE-5718
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5718
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mr-am
> Affects Versions: 2.3.0, 2.6.0
> Reporter: Karthik Kambatla
> Assignee: Yang Hao
> Fix For: 2.6.0
>
> Attachments: MAPREDUCE-5718.v2.patch, mr-5718-0.patch
>
>
> when any of this happens:
> * While testing RM HA, if the RM fails over while an MR AM is in the middle
> of a commit,
> * When testing preempting, if the MR AM fails over during the middle of a
> commit
> the subsequent AM gets spawned but dies with a diagnostic message - "We
> crashed durring a commit".
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)