Yang Wang created FLINK-13938:
---------------------------------

             Summary: Use yarn public distributed cache to speed up containers 
launch
                 Key: FLINK-13938
                 URL: https://issues.apache.org/jira/browse/FLINK-13938
             Project: Flink
          Issue Type: New Feature
            Reporter: Yang Wang


By default, the LocalResourceVisibility is APPLICATION, so they will be 
downloaded only once and shared for all taskmanager containers of a same 
application in the same node. However, different applications will have to 
download all jars every time, including the flink-dist.jar. I think we could 
use the yarn public cache to eliminate the unnecessary jars downloading and 
make launching container faster.

 

How to use the shared lib feature?
 # Upload a copy of flink release binary to hdfs.
 # Use the -ysl argument to specify the shared lib

{code:java}
./bin/flink run -d -m yarn-cluster -p 20 -ysl 
hdfs:///flink/release/flink-1.9.0/lib examples/streaming/WindowJoin.jar{code}
 

-ysl,--yarnsharedLib <path>          Upload a copy of flink lib beforehand

                                                          and specify the path 
to use public

                                                          visibility feature of 
YARN NodeManager

                                                          localizing resources.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

Reply via email to