[ 
https://issues.apache.org/jira/browse/YARN-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karthik Kambatla reopened YARN-1055:
------------------------------------


Here is one use case where setting app retries alone might not be enough.

Consider jobs being run via Oozie. We want different behavior for the launched 
actions in the following two scenarios:
# Without RM restart, we want to be able to set app-retries > 1 to address any 
transient errors.
# On RM restart, we want Oozie launchers to be recovered, but not the already 
launched actions. Otherwise, we will end up having two copies of the same 
action.

Thoughts? 
                
> App recovery should be configurable per application
> ---------------------------------------------------
>
>                 Key: YARN-1055
>                 URL: https://issues.apache.org/jira/browse/YARN-1055
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>    Affects Versions: 2.1.0-beta
>            Reporter: Karthik Kambatla
>
> In Hadoop-1, the job recovery on JT restart is configurable per-job. For 
> parity and its usefulness, we should have the same behavior in YARN as well.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to