[ 
https://issues.apache.org/jira/browse/TOREE-344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Luciano Resende resolved TOREE-344.
-----------------------------------
       Resolution: Won't Fix
    Fix Version/s: 0.2.0

This is true for any python dependencies and I believe its more like a runtime 
feature request rather than a kernel issue.

> No module named pyspark
> -----------------------
>
>                 Key: TOREE-344
>                 URL: https://issues.apache.org/jira/browse/TOREE-344
>             Project: TOREE
>          Issue Type: Bug
>            Reporter: haniar
>            Priority: Major
>             Fix For: 0.2.0
>
>
> I have installed toree to my jupyter environment 
> (https://github.com/apache/incubator-toree) and written a piece of code that 
> works with pyspark. Yarn starts properly and I can see the containers running 
> in the queue,
> When I run the code, I get the following error
> Error from python worker:
>   /usr/local/bin/python2.7: No module named pyspark
> the kernel is set-up as follows:
> {
>   "language": "python",
>   "display_name": "Apache Toree - PySpark",
>   "env": {
>     "__TOREE_SPARK_OPTS__": " --master yarn",
>     "SPARK_HOME": "/usr/hdp/2.4.2.0-258/spark",
>     "__TOREE_OPTS__": "",
>     "DEFAULT_INTERPRETER": "PySpark",
>     "PYTHONPATH": 
> "/usr/hdp/2.4.2.0-258/spark/python:/usr/hdp/2.4.2.0-258/spark/python/lib/py4j-0.9-src.zip",
>     "PYTHON_EXEC": "python",
>  "PYTHONSTARTUP": "/usr/hdp/2.4.2.0-258/spark/python/pyspark/shell.py",
>  "PYSPARK_PYTHON": "/usr/local/bin/python2.7",
>        "PYSPARK_DRIVER_PYTHON": "/usr/local/bin/python2.7"
>   },
>   "argv": [
>     "/usr/local/share/jupyter/kernels/apache_toree_pyspark/bin/run.sh",
>     "--profile",
>     "{connection_file}"
>   ]
> }



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to