[
https://issues.apache.org/jira/browse/YARN-9126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16723182#comment-16723182
]
Chandni Singh commented on YARN-9126:
-------------------------------------
[~eyang] I think this is because of YARN-7644.
Before this change, the cleanup of the container working directory was done in
a blocking way. This change made it non-blocking which is causing the issue.
> Container reinit always fails in branch-3.2 and trunk
> -----------------------------------------------------
>
> Key: YARN-9126
> URL: https://issues.apache.org/jira/browse/YARN-9126
> Project: Hadoop YARN
> Issue Type: Sub-task
> Reporter: Eric Yang
> Assignee: Chandni Singh
> Priority: Major
> Labels: docker
>
> When upgrading container, container reinitialization always failed with code
> 33. This error code means the localizing file already exist while copying
> resource files. The container will retry with another container ID, hence
> the problem is masked.
> Hadoop 3.1.x relaunch logic seem to have some way to prevent this bug from
> happening. The same logic might be useful in branch 3.2 and trunk.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]