[ https://issues.apache.org/jira/browse/YARN-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Karthik Kambatla reopened YARN-1055: ------------------------------------ Here is one use case where setting app retries alone might not be enough. Consider jobs being run via Oozie. We want different behavior for the launched actions in the following two scenarios: # Without RM restart, we want to be able to set app-retries > 1 to address any transient errors. # On RM restart, we want Oozie launchers to be recovered, but not the already launched actions. Otherwise, we will end up having two copies of the same action. Thoughts? > App recovery should be configurable per application > --------------------------------------------------- > > Key: YARN-1055 > URL: https://issues.apache.org/jira/browse/YARN-1055 > Project: Hadoop YARN > Issue Type: Sub-task > Components: resourcemanager > Affects Versions: 2.1.0-beta > Reporter: Karthik Kambatla > > In Hadoop-1, the job recovery on JT restart is configurable per-job. For > parity and its usefulness, we should have the same behavior in YARN as well. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira