Benjamin Teke created YARN-11463:
------------------------------------

             Summary: Node Labels root directory creation doesn't have a retry 
logic
                 Key: YARN-11463
                 URL: https://issues.apache.org/jira/browse/YARN-11463
             Project: Hadoop YARN
          Issue Type: Improvement
          Components: capacity scheduler
            Reporter: Benjamin Teke


When CS is initialized, it'll [try to create the configured node labels root 
dir|https://github.com/apache/hadoop/blob/7169ec450957e5602775c3cd6fe1bf0b95773dfb/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/nodelabels/store/AbstractFSNodeStore.java#L69].
 This however doesn't implement any kind of retry logic (in contrast to the RM 
FS state store or ZK state store), hence if the distributed file system is 
unavailable at the exact moment CS tries to start it'll fail. A retry logic 
could be implemented to improve the robustness of the startup process.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to