[
https://issues.apache.org/jira/browse/YARN-8672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16677365#comment-16677365
]
Eric Yang commented on YARN-8672:
---------------------------------
[~csingh] I get this error when localization is happening for a yarn service:
{code}
java.io.FileNotFoundException: File
file:/tmp/hadoop-yarn/nm-local-dir/usercache/hbase/appcache/application_1541542727828_0001/container_1541542727828_0001_01_000001.tokens
does not exist
at
org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:666)
at
org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:987)
at
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:656)
at
org.apache.hadoop.fs.RawLocalFileSystem.open(RawLocalFileSystem.java:212)
at
org.apache.hadoop.fs.DelegateToFileSystem.open(DelegateToFileSystem.java:190)
at
org.apache.hadoop.fs.AbstractFileSystem.open(AbstractFileSystem.java:651)
at org.apache.hadoop.fs.FilterFs.open(FilterFs.java:220)
at org.apache.hadoop.fs.FileContext$6.next(FileContext.java:869)
at org.apache.hadoop.fs.FileContext$6.next(FileContext.java:865)
at org.apache.hadoop.fs.FSLinkResolver.resolve(FSLinkResolver.java:90)
at org.apache.hadoop.fs.FileContext.open(FileContext.java:871)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer.runLocalization(ContainerLocalizer.java:160)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer.main(ContainerLocalizer.java:466)
{code}
In nmPrivate, I found the token filename to be:
container_1541542727828_0001_01_000001166eb1c3e5c.tokens
There is some breakage by this patch that c version of container executor is
unable to identify the correct token file to be copied to container working
directory.
> TestContainerManager#testLocalingResourceWhileContainerRunning occasionally
> times out
> -------------------------------------------------------------------------------------
>
> Key: YARN-8672
> URL: https://issues.apache.org/jira/browse/YARN-8672
> Project: Hadoop YARN
> Issue Type: Bug
> Components: nodemanager
> Affects Versions: 3.2.0
> Reporter: Jason Lowe
> Assignee: Chandni Singh
> Priority: Major
> Attachments: YARN-8672.001.patch, YARN-8672.002.patch,
> YARN-8672.003.patch, YARN-8672.004.patch, YARN-8672.005.patch
>
>
> Precommit builds have been failing in
> TestContainerManager#testLocalingResourceWhileContainerRunning. I have been
> able to reproduce the problem without any patch applied if I run the test
> enough times. It looks like something is removing container tokens from the
> nmPrivate area just as a new localizer starts.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]