[ 
https://issues.apache.org/jira/browse/YARN-542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13628019#comment-13628019
 ] 

Zhijie Shen commented on YARN-542:
----------------------------------

@Vinod , how about conservatively changing max attempts to 2, if there's no 
strong motivation to give AM multiple retry chances?
                
> Change the default AM retry value to be not one
> -----------------------------------------------
>
>                 Key: YARN-542
>                 URL: https://issues.apache.org/jira/browse/YARN-542
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Vinod Kumar Vavilapalli
>            Assignee: Vinod Kumar Vavilapalli
>
> Today, the AM max-retries is set to 1 which is a bad choice. AM max-retries 
> accounts for both AM level failures as well as container crashes due to 
> localization issue, lost nodes etc. To account for AM crashes due to problems 
> that are not caused by user code, mainly lost nodes, we want to give AMs some 
> retires.
> I propose we change it to atleast two. Can change it to 4 to match other 
> retry-configs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to