[
https://issues.apache.org/jira/browse/YARN-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16059800#comment-16059800
]
Arun Suresh commented on YARN-6127:
-----------------------------------
Will commit this shortly.
One suggestion though - there are a lot of places where we check if the
stateStore != null. This is true in the existing codebase too. Ideally, the
stateStore should never be null and we should have to perform so many defensive
checks. [~botong], can you open a followup JIRA to fix this ?
> Add support for work preserving NM restart when AMRMProxy is enabled
> --------------------------------------------------------------------
>
> Key: YARN-6127
> URL: https://issues.apache.org/jira/browse/YARN-6127
> Project: Hadoop YARN
> Issue Type: Sub-task
> Components: amrmproxy, nodemanager
> Reporter: Subru Krishnan
> Assignee: Botong Huang
> Attachments: YARN-6127.v1.patch, YARN-6127.v2.patch,
> YARN-6127.v3.patch, YARN-6127.v4.patch
>
>
> YARN-1336 added the ability to restart NM without loosing any running
> containers. In a Federated YARN environment, there's additional state in the
> {{AMRMProxy}} to allow for spanning across multiple sub-clusters, so we need
> to enhance {{AMRMProxy}} to support work-preserving restart.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]