[
https://issues.apache.org/jira/browse/YARN-7839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Panagiotis Garefalakis updated YARN-7839:
-----------------------------------------
Description:
Currently, the Algorithm assigns a node to a request purely based on if the
constraints are met. It is later in the scheduling phase that the Queue
capacity and Node capacity are checked. If the request cannot be placed because
of unavailable Queue/Node capacity, the request is retried by the Algorithm.
For clusters that are running at high utilization, we can reduce the retries if
we perform the Node capacity check in the Algorithm as well. The Queue capacity
check and the other user limit checks can still be handled by the scheduler
(since queues and other limits are tied to the scheduler, and not scheduler
agnostic)
was:
Currently, the Algorithm assigns a node to a requests purely based on if the
constraints are met. It is later in the scheduling phase that the Queue
capacity and Node capacity are checked. If the request cannot be placed because
of unavailable Queue/Node capacity, the request is retried by the Algorithm.
For clusters that are running at high utilization, we can reduce the retries if
we perform the Node capacity check in the Algorithm as well. The Queue capacity
check and the other user limit checks can still be handled by the scheduler
(since queues and other limits are tied to the scheduler, and not scheduler
agnostic)
> Check node capacity before placing in the Algorithm
> ---------------------------------------------------
>
> Key: YARN-7839
> URL: https://issues.apache.org/jira/browse/YARN-7839
> Project: Hadoop YARN
> Issue Type: Sub-task
> Reporter: Arun Suresh
> Priority: Major
>
> Currently, the Algorithm assigns a node to a request purely based on if the
> constraints are met. It is later in the scheduling phase that the Queue
> capacity and Node capacity are checked. If the request cannot be placed
> because of unavailable Queue/Node capacity, the request is retried by the
> Algorithm.
> For clusters that are running at high utilization, we can reduce the retries
> if we perform the Node capacity check in the Algorithm as well. The Queue
> capacity check and the other user limit checks can still be handled by the
> scheduler (since queues and other limits are tied to the scheduler, and not
> scheduler agnostic)
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]