[
https://issues.apache.org/jira/browse/YARN-4770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15675321#comment-15675321
]
Jian He commented on YARN-4770:
-------------------------------
[~hex108],
bq. If container crashed during the NM reboot, container would transit to
RELAUNCHING state. I will check it again.
Is this working now ? if so, we can close this.
> Auto-restart of containers should work across NM restarts.
> ----------------------------------------------------------
>
> Key: YARN-4770
> URL: https://issues.apache.org/jira/browse/YARN-4770
> Project: Hadoop YARN
> Issue Type: Sub-task
> Reporter: Vinod Kumar Vavilapalli
> Assignee: Vinod Kumar Vavilapalli
>
> See my comment
> [here|https://issues.apache.org/jira/browse/YARN-3998?focusedCommentId=15133367&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15133367]
> on YARN-3998. Need to take care of two things:
> - The relaunch feature needs to work across NM restarts, so we should save
> the retry-context and policy per container into the state-store and reload it
> for continue relaunching after NM restart.
> - We should also handle restarting of any containers that may have crashed
> during the NM reboot.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]