[
https://issues.apache.org/jira/browse/YARN-10902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
jackwangcs updated YARN-10902:
------------------------------
Description:
If a node has reserved container of an application and the application adds
this node into its blacklist, resources on the node can not allocate other
applications in the current allocation process.
In RegularContainerAllocator,
{code:java}
ContainerAllocation tryAllocateOnNode(Resource clusterResource,
FiCaSchedulerNode node, SchedulingMode schedulingMode,
ResourceLimits resourceLimits, SchedulerRequestKey schedulerKey,
RMContainer reservedContainer) {
ContainerAllocation result;
// Sanity checks before assigning to this node
result = checkIfNodeBlackListed(node, schedulerKey);
if (null != result) {
return result;
}
// ....
}{code}
In this case, the reserved container should be cancelled.
was:If a node has reserved container of an application and the application
adds this node into its blacklist, resources on the node can not allocate other
applications in the current allocation process.
> Resources on application blacklisted node with reserved container can not
> allocate to other applications
> --------------------------------------------------------------------------------------------------------
>
> Key: YARN-10902
> URL: https://issues.apache.org/jira/browse/YARN-10902
> Project: Hadoop YARN
> Issue Type: Bug
> Components: capacityscheduler
> Reporter: jackwangcs
> Priority: Major
>
> If a node has reserved container of an application and the application adds
> this node into its blacklist, resources on the node can not allocate other
> applications in the current allocation process.
> In RegularContainerAllocator,
> {code:java}
> ContainerAllocation tryAllocateOnNode(Resource clusterResource,
> FiCaSchedulerNode node, SchedulingMode schedulingMode,
> ResourceLimits resourceLimits, SchedulerRequestKey schedulerKey,
> RMContainer reservedContainer) {
> ContainerAllocation result;
> // Sanity checks before assigning to this node
> result = checkIfNodeBlackListed(node, schedulerKey);
> if (null != result) {
> return result;
> }
> // ....
> }{code}
> In this case, the reserved container should be cancelled.
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]