[
https://issues.apache.org/jira/browse/YARN-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16583168#comment-16583168
]
Hudson commented on YARN-8667:
------------------------------
SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #14788 (See
[https://builds.apache.org/job/Hadoop-trunk-Commit/14788/])
YARN-8667. Cleanup symlinks when container restarted by NM. (eyang:
rev d42806160eb95594f08f38bb753cf0306a191a38)
* (edit)
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/TestContainerExecutor.java
* (edit)
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/ContainerExecutor.java
* (edit)
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/launcher/ContainerLaunch.java
> Cleanup symlinks when container restarted by NM to solve issue "find: File
> system loop detected;" for tar ball artifacts.
> -------------------------------------------------------------------------------------------------------------------------
>
> Key: YARN-8667
> URL: https://issues.apache.org/jira/browse/YARN-8667
> Project: Hadoop YARN
> Issue Type: Bug
> Reporter: Rohith Sharma K S
> Assignee: Chandni Singh
> Priority: Critical
> Attachments: YARN-8667.001.patch, YARN-8667.002.patch
>
>
> Service is launched with TAR BALL artifacts. If a container is exited due to
> any reasons, container relaunch policy try to relaunch the container on same
> node with same container work space. As a result, container relaunch is keep
> on failing.
> If container relaunch max-retry policy is disabled, then container never
> launched in any other nodes also rather it keep on retrying on same node
> manager which never succeeds.
> {code}
> Relaunching Container container_e05_1533635581781_0001_01_000002. Remaining
> retry attempts(after relaunch) : -4816.
> {code}
> There are two issues
> # Container relaunch is keep on failing
> # Log message is misleading
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]