[ https://issues.apache.org/jira/browse/YARN-10603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17276087#comment-17276087 ]
kyungwan nam edited comment on YARN-10603 at 2/1/21, 6:33 AM: -------------------------------------------------------------- I've attached a patch. this patch works well in our cluster. Please review and comment. Thanks. was (Author: kyungwan nam): I've attached a patch. Please review and comment. Thanks > Failed to reinitialize for recovered container > ---------------------------------------------- > > Key: YARN-10603 > URL: https://issues.apache.org/jira/browse/YARN-10603 > Project: Hadoop YARN > Issue Type: Bug > Reporter: kyungwan nam > Assignee: kyungwan nam > Priority: Major > Attachments: YARN-10603.001.patch > > > Container reinitializing request does not work after restarting NM. > I found some problem as below. > - when a recovered container is terminated, exiting occurs because it makes > always either CONTAINER_EXITED_WITH_FAILURE or CONTAINER_EXITED_WITH_SUCCESS > - container’s *recoveredStatus* is set at the time of NM recovery. and it is > never changed even though the container is terminated. > as a result, newly reinitializing container will be launched as a recovered > container, but it doesn't work -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org