Meng Zhu created MESOS-9806:
-------------------------------

             Summary: Address allocator performance regression due to the 
removal of quota role sorter.
                 Key: MESOS-9806
                 URL: https://issues.apache.org/jira/browse/MESOS-9806
             Project: Mesos
          Issue Type: Improvement
          Components: allocation
            Reporter: Meng Zhu
            Assignee: Meng Zhu


In MESOS-9802, we removed the quota role sorter which is tech debt.

However, this slows down the allocator. The problem is that in the first stage, 
even though a cluster might have no active roles with non-default quota, the 
allocator will now have to sort and go through each and every role in the 
cluster. Benchmark result shows that for 1k roles with 2k frameworks, the 
allocator could experience ~50% performance degradation.

There are a couple of ways to address this issue. For example, we could make 
the sorter aware of quota. And add a method, say `sortQuotaRoles`, to return 
all the roles with non-default quota. Alternatively, an even better approach 
would be to deprecate the sorter concept and just have two standalone functions 
e.g. sortRoles() and sortQuotaRoles() that takes in the role tree structure 
(not yet exist in the allocator) and return the sorted roles.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to