[ https://issues.apache.org/jira/browse/YARN-11798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Shilun Fan resolved YARN-11798. ------------------------------- Fix Version/s: 3.5.0 Hadoop Flags: Reviewed Target Version/s: 3.5.0 Resolution: Fixed > Precheck request separately to avoid redundant node checks and optimize > performance for global scheduler. > --------------------------------------------------------------------------------------------------------- > > Key: YARN-11798 > URL: https://issues.apache.org/jira/browse/YARN-11798 > Project: Hadoop YARN > Issue Type: Improvement > Components: capacityscheduler > Reporter: Tao Yang > Assignee: Tao Yang > Priority: Major > Labels: pull-request-available > Fix For: 3.5.0 > > > The current implementation of the > RegularContainerAllocator#preCheckCandidateNodes performs redundant > check-request logic for each node, leading to significant latency during > global scheduling with a large number of candidate nodes. > By splitting this method into two separate methods: preCheckRequest and > preCheckNode, we can ensure that the check-request logic is executed only > once per request before checking all nodes, thereby optimizing scheduling > performance and reducing latency. > > In a scheduling scenario with 2000 candidate nodes and an unsatisfied request > that reaches the headroom, the global scheduler currently checks the request > for each node repeatedly, causing a scheduling cycle to exceed 200ms. With > this improvement, the global scheduler checks the request only once before > evaluating all nodes, reducing the scheduling cycle to less than 2ms. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org