Tamas Domok created YARN-11022:
----------------------------------

             Summary: Fix the documentation for max-parallel-apps in CS
                 Key: YARN-11022
                 URL: https://issues.apache.org/jira/browse/YARN-11022
             Project: Hadoop YARN
          Issue Type: Bug
          Components: capacity scheduler
    Affects Versions: 3.4.0
            Reporter: Tamas Domok
            Assignee: Tamas Domok


The documentation does not mention that the max-parallel-apps property is 
inherited. The property can be overridden on a per queue basis, but the 
parent(s) can also restrict how many parallel apps can be run.

 

{*}yarn.scheduler.capacity.max-parallel-apps / 
yarn.scheduler.capacity.<queue-path>.max-parallel-apps{*}: Maximum number of 
applications that can run at the same time. Unlike to maximum-applications, 
application submissions are not rejected when this limit is reached. Instead 
they stay in ACCEPTED state until they are eligible to run. This can be set for 
all queues with yarn.scheduler.capacity.max-parallel-apps and can also be 
overridden on a per queue basis by setting 
yarn.scheduler.capacity.<queue-path>.max-parallel-apps. Integer value is 
expected. By default, there is no limit.

 

[https://github.com/apache/hadoop/blob/03cfc852791c14fad39db4e5b14104a276c08e59/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/CSMaxRunningAppsEnforcer.java#L99]
  private boolean exceedQueueMaxParallelApps(AbstractCSQueue queue) {
    // Check queue and all parent queues    while (queue != null) {
      if (queue.getNumRunnableApps() >= queue.getMaxParallelApps()) {
        LOG.info("Maximum runnable apps exceeded for queue {}",
            queue.getQueuePath());
        return true;
      }
      queue = (AbstractCSQueue) queue.getParent();
    }    return false;
  } 

Example:



Let's say the user configured the *yarn.scheduler.capacity.max-parallel-apps* 
to 250, that will be the default for queues that doesn't override the setting. 
([https://github.com/apache/hadoop/blob/32ecaed9c3c06a48ef01d0437e62e8faccd3e9f3/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/CapacitySchedulerConfiguration.java#L1688])

Given this queue hierarchy:
||root||.a||.a1||.a2||.a3||.a4||
|500|default|50|10|default|15|
||root||.a||.b||
|500|default|50|
 - maximum 250 apps can run parallel under the *root.a* queues.
 - maximum 50 apps can run parallel under the *root.a.a1* queues.
 - maximum 10 apps can run parallel under the *root.a.a1.a2* queues.
 - maximum *10* apps can run parallel under the *root.a1.a2.a3* queues. (even 
though the max-parallel-apps is not set for .a3 so the default 250 applies for 
that queue, but it's parent had a lower value, and children can't exceed that)
 - maximum *10* apps can run parallel under the *root.a1.a2.a3.a4* queue. (even 
though it's configured for 15, the parents restrict this limit to 10)
 - maximum 50 apps can run parallel under the *root.a.b* queue.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to