Chang Li created YARN-4105:
------------------------------
Summary: Capacity Scheduler headroom for DRF is wrong
Key: YARN-4105
URL: https://issues.apache.org/jira/browse/YARN-4105
Project: Hadoop YARN
Issue Type: Bug
Reporter: Chang Li
Assignee: Chang Li
relate to the problem discussed in YARN-1857. But the min method is flawed when
we are using DRC. Have run into a real scenario in production where
queueCapacity: <memory:1056256, vCores:3750>, qconsumed: <memory:1054720,
vCores:361>, consumed: <memory:125952, vCores:170> limit: <memory:214016,
vCores:755>. headRoom calculation returns 88064 where there is only 1536 left
in the queue because DRC effectively compare by vcores. It then caused deadlock
because RMcontainer allocator thought there is still space for mapper and won't
preempt a reducer in a full queue to schedule a mapper. Propose fix with
componentwiseMin.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)