[ https://issues.apache.org/jira/browse/YARN-2176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14038045#comment-14038045 ]
Sandy Ryza commented on YARN-2176: ---------------------------------- Can we merge the ActiveUsersManager stuff into an abstract SchedulerLeafQueue class that FSLeafQueue and LeafQueue extend from? AppSchedulingInfo is private / unstable, so we can modify it's constructor to take to take a SchedulerLeafQueue instead of a Queue. > CapacityScheduler loops over all running applications rather than actively > requesting apps > ------------------------------------------------------------------------------------------ > > Key: YARN-2176 > URL: https://issues.apache.org/jira/browse/YARN-2176 > Project: Hadoop YARN > Issue Type: Improvement > Components: capacityscheduler > Affects Versions: 2.4.0 > Reporter: Jason Lowe > > The capacity scheduler performance is primarily dominated by > LeafQueue.assignContainers, and that currently loops over all applications > that are running in the queue. It would be more efficient if we looped over > just the applications that are actively asking for resources rather than all > applications, as there could be thousands of applications running but only a > few hundred that are currently asking for resources. -- This message was sent by Atlassian JIRA (v6.2#6252)