[ https://issues.apache.org/jira/browse/YARN-6265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893101#comment-15893101 ]
Junping Du commented on YARN-6265: ---------------------------------- Agree. IIRC, the later one - state store operation failure cause RM fail fast is this configuration get designed for. cc [~kasha], [~jianhe]. This is to control system level risk for the whole cluster. For app w/o valid queue get submitted, we should have other configuration to identify the expected behavior. > yarn.resourcemanager.fail-fast is used inconsistently > ----------------------------------------------------- > > Key: YARN-6265 > URL: https://issues.apache.org/jira/browse/YARN-6265 > Project: Hadoop YARN > Issue Type: Bug > Components: resourcemanager > Affects Versions: 2.8.0 > Reporter: Daniel Templeton > > In capacity scheduler, the property is used to control whether an app with > no/bad queue should be killed. In the state store, the property controls > whether a state store op failure should cause the RM to exit in non-HA mode. > Those are two very different things, and they should be separated. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org