Sandy Ryza created SPARK-3560:
---------------------------------

             Summary: In yarn-cluster mode, jars are distributed through 
multiple mechanisms.
                 Key: SPARK-3560
                 URL: https://issues.apache.org/jira/browse/SPARK-3560
             Project: Spark
          Issue Type: Bug
            Reporter: Sandy Ryza
            Priority: Critical


In yarn-cluster mode, jars given to spark-submit's --jars argument should be 
distributed to executors through the distributed cache, not through fetching.

Currently, Spark tries to distribute the jars both ways, which can cause 
executor errors related to trying to overwrite symlinks without write 
permissions.

It looks like this was introduced by SPARK-2260, which sets spark.jars in 
yarn-cluster mode.  Setting spark.jars is necessary for standalone cluster 
deploy mode, but harmful for yarn cluster deploy mode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to