[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14305036#comment-14305036
 ] 

Yang Hao commented on MAPREDUCE-5718:
-------------------------------------

Hi, I have read the comment twice. It may not be safe to recover under this 
situation. 

I'm wondering whether you are worring about missing data. But during the 
recovery, task output data will be moved to the new output dir, just as the 
normal AM failover does, then it's safe to recover

Our cluster will add this feature. Can you give more specific information on 
the unsafe situation? Thanks a lot

> MR job will fail after commit fail
> ----------------------------------
>
>                 Key: MAPREDUCE-5718
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5718
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mr-am
>    Affects Versions: 2.3.0, 2.6.0
>            Reporter: Karthik Kambatla
>            Assignee: Yang Hao
>             Fix For: 2.6.0
>
>         Attachments: MAPREDUCE-5718.v2.patch, mr-5718-0.patch
>
>
> when any of this happens:
> * While testing RM HA, if the RM fails over while an MR AM is in the middle 
> of a commit, 
> * When testing preempting, if the MR AM fails over during the middle of a 
> commit
> the subsequent AM gets spawned but dies with a diagnostic message - "We 
> crashed durring a commit". 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to