Benjamin Teke created YARN-11463: ------------------------------------ Summary: Node Labels root directory creation doesn't have a retry logic Key: YARN-11463 URL: https://issues.apache.org/jira/browse/YARN-11463 Project: Hadoop YARN Issue Type: Improvement Components: capacity scheduler Reporter: Benjamin Teke
When CS is initialized, it'll [try to create the configured node labels root dir|https://github.com/apache/hadoop/blob/7169ec450957e5602775c3cd6fe1bf0b95773dfb/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/nodelabels/store/AbstractFSNodeStore.java#L69]. This however doesn't implement any kind of retry logic (in contrast to the RM FS state store or ZK state store), hence if the distributed file system is unavailable at the exact moment CS tries to start it'll fail. A retry logic could be implemented to improve the robustness of the startup process. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org