[ 
https://issues.apache.org/jira/browse/YARN-6265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893101#comment-15893101
 ] 

Junping Du commented on YARN-6265:
----------------------------------

Agree. IIRC, the later one - state store operation failure cause RM fail fast 
is this configuration get designed for. cc [~kasha], [~jianhe]. This is to 
control system level risk for the whole cluster. For app w/o valid queue get 
submitted, we should have other configuration to identify the expected behavior.

> yarn.resourcemanager.fail-fast is used inconsistently
> -----------------------------------------------------
>
>                 Key: YARN-6265
>                 URL: https://issues.apache.org/jira/browse/YARN-6265
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.8.0
>            Reporter: Daniel Templeton
>
> In capacity scheduler, the property is used to control whether an app with 
> no/bad queue should be killed.  In the state store, the property controls 
> whether a state store op failure should cause the RM to exit in non-HA mode.  
> Those are two very different things, and they should be separated.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to