[
https://issues.apache.org/jira/browse/YARN-6265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893101#comment-15893101
]
Junping Du commented on YARN-6265:
----------------------------------
Agree. IIRC, the later one - state store operation failure cause RM fail fast
is this configuration get designed for. cc [~kasha], [~jianhe]. This is to
control system level risk for the whole cluster. For app w/o valid queue get
submitted, we should have other configuration to identify the expected behavior.
> yarn.resourcemanager.fail-fast is used inconsistently
> -----------------------------------------------------
>
> Key: YARN-6265
> URL: https://issues.apache.org/jira/browse/YARN-6265
> Project: Hadoop YARN
> Issue Type: Bug
> Components: resourcemanager
> Affects Versions: 2.8.0
> Reporter: Daniel Templeton
>
> In capacity scheduler, the property is used to control whether an app with
> no/bad queue should be killed. In the state store, the property controls
> whether a state store op failure should cause the RM to exit in non-HA mode.
> Those are two very different things, and they should be separated.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]