[ 
https://issues.apache.org/jira/browse/YARN-9106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16920328#comment-16920328
 ] 

Abhishek Modi commented on YARN-9106:
-------------------------------------

Gracefully decommissioning nodes without waiting for applications to finish 
also makes sense when shuffle data is offloaded to persistent storage or 
shuffle service is running completely outside nodes.

While running Yarn on cloud, it is very common to offload shuffle data to 
persistent volumes and remove nodes. cc [~elgoiri]

> Add option to graceful decommission to not wait for applications
> ----------------------------------------------------------------
>
>                 Key: YARN-9106
>                 URL: https://issues.apache.org/jira/browse/YARN-9106
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>            Reporter: Mikayla Konst
>            Assignee: Mikayla Konst
>            Priority: Major
>         Attachments: YARN-9106.patch
>
>
> Add property 
> yarn.resourcemanager.decommissioning-nodes-watcher.wait-for-applications.
> If true (the default), the resource manager waits for all containers, as well 
> as all applications associated with those containers, to finish before 
> gracefully decommissioning a node.
> If false, the resource manager only waits for containers, but not 
> applications, to finish. For map-only jobs or other jobs in which mappers do 
> not need to serve shuffle data, this allows nodes to be decommissioned as 
> soon as their containers are finished as opposed to when the job is done.
> Add property 
> yarn.resourcemanager.decommissioning-nodes-watcher.wait-for-app-masters.
> If false, during graceful decommission, when the resource manager waits for 
> all containers on a node to finish, it will not wait for app master 
> containers to finish. Defaults to true. This property should only be set to 
> false if app master failure is recoverable.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to