[ https://issues.apache.org/jira/browse/MAPREDUCE-5956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14107023#comment-14107023 ]
Jason Lowe commented on MAPREDUCE-5956: --------------------------------------- When it comes to leaking staging directories, there are far more common cases where that occurs than this scenario. e.g.: application killed before AM starts or in-between AM retries, AM is misconfigured and fails every time, etc. It seems like the scenario we're worried about is highly unlikely, so I don't think it'd be a big deal to put into 2.5.1 from that standpoint. IIRC the problem being fixed here isn't an issue unless preemption is a factor. If we're officially supporting preemption in 2.5 then I think this is a good candidate to consider for 2.5.1, otherwise it can wait until 2.6. > MapReduce AM should not use maxAttempts to determine if this is the last retry > ------------------------------------------------------------------------------ > > Key: MAPREDUCE-5956 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-5956 > Project: Hadoop Map/Reduce > Issue Type: Sub-task > Components: applicationmaster, mrv2 > Affects Versions: 2.4.0 > Reporter: Vinod Kumar Vavilapalli > Assignee: Wangda Tan > Priority: Blocker > Fix For: 2.6.0 > > Attachments: MR-5956.patch, MR-5956.patch > > > Found this while reviewing YARN-2074. The problem is that after YARN-2074, we > don't count AM preemption towards AM failures on RM side, but MapReduce AM > itself checks the attempt id against the max-attempt count to determine if > this is the last attempt. > {code} > public void computeIsLastAMRetry() { > isLastAMRetry = appAttemptID.getAttemptId() >= maxAppAttempts; > } > {code} > This causes issues w.r.t deletion of staging directory etc.. -- This message was sent by Atlassian JIRA (v6.2#6252)