[jira] [Commented] (YARN-2331) Distinguish shutdown during supervision vs. shutdown for rolling upgrade

Karthik Palaniappan (JIRA) Tue, 14 Nov 2017 17:11:58 -0800

    [ 
https://issues.apache.org/jira/browse/YARN-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16252784#comment-16252784
 ]


Karthik Palaniappan commented on YARN-2331:
-------------------------------------------

Toggling this new configuration property (yarn.nodemanager.recovery.supervised) 
isn't very different than just toggling the property that enables recovery 
(yarn.nodemanager.recovery.enabled). It's surprising that you now need to flip 
two properties to get NM work preservation to work.

Is there a reason that you need to distinguish between a supervised NM shutdown 
and a rolling upgrade related shutdown?

I'm complaining because the instructions in the 2.7 line are incorrect in 2.8: 
https://hadoop.apache.org/docs/r2.7.4/hadoop-yarn/hadoop-yarn-site/NodeManagerRestart.html.
 Equivalent docs don't exist in the 2.8 line (i.e. if you change the url to be 
r2.8.2), so I couldn't find any documentation of this new property.

> Distinguish shutdown during supervision vs. shutdown for rolling upgrade
> ------------------------------------------------------------------------
>
>                 Key: YARN-2331
>                 URL: https://issues.apache.org/jira/browse/YARN-2331
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager
>    Affects Versions: 2.6.0
>            Reporter: Jason Lowe
>            Assignee: Jason Lowe
>              Labels: BB2015-05-RFC
>             Fix For: 2.8.0, 3.0.0-alpha1
>
>         Attachments: YARN-2331.patch, YARN-2331v2.patch, YARN-2331v3.patch
>
>
> When the NM is shutting down with restart support enabled there are scenarios 
> we'd like to distinguish and behave accordingly:
> # The NM is running under supervision.  In that case containers should be 
> preserved so the automatic restart can recover them.
> # The NM is not running under supervision and a rolling upgrade is not being 
> performed.  In that case the shutdown should kill all containers since it is 
> unlikely the NM will be restarted in a timely manner to recover them.
> # The NM is not running under supervision and a rolling upgrade is being 
> performed.  In that case the shutdown should not kill all containers since a 
> restart is imminent due to the rolling upgrade and the containers will be 
> recovered.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (YARN-2331) Distinguish shutdown during supervision vs. shutdown for rolling upgrade

Reply via email to