Jason Lowe created YARN-2176:

             Summary: CapacityScheduler loops over all running applications 
rather than actively requesting apps
                 Key: YARN-2176
                 URL: https://issues.apache.org/jira/browse/YARN-2176
             Project: Hadoop YARN
          Issue Type: Improvement
          Components: capacityscheduler
    Affects Versions: 2.4.0
            Reporter: Jason Lowe

The capacity scheduler performance is primarily dominated by 
LeafQueue.assignContainers, and that currently loops over all applications that 
are running in the queue.  It would be more efficient if we looped over just 
the applications that are actively asking for resources rather than all 
applications, as there could be thousands of applications running but only a 
few hundred that are currently asking for resources.

This message was sent by Atlassian JIRA

Reply via email to