[ 
https://issues.apache.org/jira/browse/YARN-8672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16677365#comment-16677365
 ] 

Eric Yang commented on YARN-8672:
---------------------------------

[~csingh] I get this error when localization is happening for a yarn service:

{code}
java.io.FileNotFoundException: File 
file:/tmp/hadoop-yarn/nm-local-dir/usercache/hbase/appcache/application_1541542727828_0001/container_1541542727828_0001_01_000001.tokens
 does not exist
        at 
org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:666)
        at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:987)
        at 
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:656)
        at 
org.apache.hadoop.fs.RawLocalFileSystem.open(RawLocalFileSystem.java:212)
        at 
org.apache.hadoop.fs.DelegateToFileSystem.open(DelegateToFileSystem.java:190)
        at 
org.apache.hadoop.fs.AbstractFileSystem.open(AbstractFileSystem.java:651)
        at org.apache.hadoop.fs.FilterFs.open(FilterFs.java:220)
        at org.apache.hadoop.fs.FileContext$6.next(FileContext.java:869)
        at org.apache.hadoop.fs.FileContext$6.next(FileContext.java:865)
        at org.apache.hadoop.fs.FSLinkResolver.resolve(FSLinkResolver.java:90)
        at org.apache.hadoop.fs.FileContext.open(FileContext.java:871)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer.runLocalization(ContainerLocalizer.java:160)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer.main(ContainerLocalizer.java:466)
{code}

In nmPrivate, I found the token filename to be: 
container_1541542727828_0001_01_000001166eb1c3e5c.tokens
There is some breakage by this patch that c version of container executor is 
unable to identify the correct token file to be copied to container working 
directory.

> TestContainerManager#testLocalingResourceWhileContainerRunning occasionally 
> times out
> -------------------------------------------------------------------------------------
>
>                 Key: YARN-8672
>                 URL: https://issues.apache.org/jira/browse/YARN-8672
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 3.2.0
>            Reporter: Jason Lowe
>            Assignee: Chandni Singh
>            Priority: Major
>         Attachments: YARN-8672.001.patch, YARN-8672.002.patch, 
> YARN-8672.003.patch, YARN-8672.004.patch, YARN-8672.005.patch
>
>
> Precommit builds have been failing in 
> TestContainerManager#testLocalingResourceWhileContainerRunning.  I have been 
> able to reproduce the problem without any patch applied if I run the test 
> enough times.  It looks like something is removing container tokens from the 
> nmPrivate area just as a new localizer starts.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to