Yang Wang created YARN-6630: ------------------------------- Summary: Container worker dir could not recover when NM restart Key: YARN-6630 URL: https://issues.apache.org/jira/browse/YARN-6630 Project: Hadoop YARN Issue Type: Bug Reporter: Yang Wang
When ContainerRetryPolicy is NEVER_RETRY, container worker dir will not be saved in NM state store. Then NM restarts, container.workDir is null, and may cause other exceptions. {code:title=ContainerLaunch.java} ... private void recordContainerWorkDir(ContainerId containerId, String workDir) throws IOException{ container.setWorkDir(workDir); if (container.isRetryContextSet()) { context.getNMStateStore().storeContainerWorkDir(containerId, workDir); } } {code} {code:title=ContainerImpl.java} static class ResourceLocalizedWhileRunningTransition extends ContainerTransition { ... String linkFile = new Path(container.workDir, link).toString(); ... {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org