[
https://issues.apache.org/jira/browse/SPARK-6047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339453#comment-14339453
]
Apache Spark commented on SPARK-6047:
-------------------------------------
User 'brkyvz' has created a pull request for this issue:
https://github.com/apache/spark/pull/4754
> pyspark - class loading on driver failing with --jars and --packages
> --------------------------------------------------------------------
>
> Key: SPARK-6047
> URL: https://issues.apache.org/jira/browse/SPARK-6047
> Project: Spark
> Issue Type: Bug
> Components: PySpark, Spark Submit
> Affects Versions: 1.3.0
> Reporter: Burak Yavuz
>
> Because py4j uses the system ClassLoader instead of the contextClassLoader of
> the thread, the dynamically added jars in Spark Submit can't be loaded in the
> driver.
> This causes `Py4JError: Trying to call a package` errors.
> Usually `--packages` are downloaded from some remote repo before runtime,
> adding them explicitly to `--driver-class-path` is not an option, like we can
> do with `--jars`. One solution is to move the fetching of `--packages` to the
> SparkSubmitDriverBootstrapper, and add it to the driver class-path there.
> A more complete solution can be achieved through [SPARK-4924].
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]