[ 
https://issues.apache.org/jira/browse/FLINK-20461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402587#comment-17402587
 ] 

Till Rohrmann commented on FLINK-20461:
---------------------------------------

I think the problem could be that we are looking for the Flink dist jar after 
the job has terminated. This also means that we are looking for this file while 
Yarn will clean up the directory of the submitted Yarn application. Hence, I 
think we are looking at a classic race condition. I'll try to verify this 
suspicion.

> YARNFileReplicationITCase.testPerJobModeWithDefaultFileReplication
> ------------------------------------------------------------------
>
>                 Key: FLINK-20461
>                 URL: https://issues.apache.org/jira/browse/FLINK-20461
>             Project: Flink
>          Issue Type: Bug
>          Components: Deployment / YARN
>    Affects Versions: 1.11.3, 1.12.0, 1.13.0, 1.14.0
>            Reporter: Huang Xingbo
>            Assignee: Till Rohrmann
>            Priority: Critical
>              Labels: test-stability
>             Fix For: 1.14.0
>
>
> [https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=10450&view=logs&j=fc5181b0-e452-5c8f-68de-1097947f6483&t=62110053-334f-5295-a0ab-80dd7e2babbf]
> {code:java}
> [ERROR] 
> testPerJobModeWithDefaultFileReplication(org.apache.flink.yarn.YARNFileReplicationITCase)
>  Time elapsed: 32.501 s <<< ERROR! java.io.FileNotFoundException: File does 
> not exist: 
> hdfs://localhost:46072/user/agent04_azpcontainer/.flink/application_1606950278664_0001/flink-dist_2.11-1.12-SNAPSHOT.jar
>  at 
> org.apache.hadoop.hdfs.DistributedFileSystem$27.doCall(DistributedFileSystem.java:1441)
>  at 
> org.apache.hadoop.hdfs.DistributedFileSystem$27.doCall(DistributedFileSystem.java:1434)
>  at 
> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
>  at 
> org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1434)
>  at 
> org.apache.flink.yarn.YARNFileReplicationITCase.extraVerification(YARNFileReplicationITCase.java:148)
>  at 
> org.apache.flink.yarn.YARNFileReplicationITCase.deployPerJob(YARNFileReplicationITCase.java:113)
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to