Hoon Park created ZEPPELIN-1883:
-----------------------------------
Summary: Can't import packages requested by SPARK_SUBMIT_OPTION in
pyspark
Key: ZEPPELIN-1883
URL: https://issues.apache.org/jira/browse/ZEPPELIN-1883
Project: Zeppelin
Issue Type: Bug
Components: pySpark
Reporter: Hoon Park
Fix For: 0.7.0
Zeppelin pyspark can't import submitted packages by {{SPARK_SUBMIT_OPTION}}.
For example,
{code}
// conf/zeppelin-env.sh
...
export SPARK_HOME="~/github/apache-spark/1.6.2-bin-hadoop2.6"
export SPARK_SUBMIT_OPTIONS="--packages
com.datastax.spark:spark-cassandra-connector_2.10:1.6.2,TargetHolding:pyspark-cassandra:0.3.5
--exclude-packages org.slf4j:slf4j-api"
...
{code}
And then try import that pyspark cassandra module in zeppelin pyspark
interpreter
{code}
import pyspark_cassandra
Traceback (most recent call last):
File
"/var/folders/lr/8g9y625n5j39rz6qhkg8s6640000gn/T/zeppelin_pyspark-5266742863961917074.py",
line 267, in <module>
raise Exception(traceback.format_exc())
Exception: Traceback (most recent call last):
File
"/var/folders/lr/8g9y625n5j39rz6qhkg8s6640000gn/T/zeppelin_pyspark-5266742863961917074.py",
line 265, in <module>
exec(code)
File "<stdin>", line 1, in <module>
ImportError: No module named pyspark_cassandra
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)