Valery Khamenya created ZEPPELIN-3743:
-----------------------------------------

             Summary: java.lang.RuntimeException: Multiple py4j files found 
under /usr/lib/spark/python/lib
                 Key: ZEPPELIN-3743
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-3743
             Project: Zeppelin
          Issue Type: Bug
          Components: pySpark, python-interpreter
    Affects Versions: 0.8.0
         Environment: AWS Linux running EMR 5.16.0 

Zeppelin 0.8 is installed using the latest official full bundle.

 
            Reporter: Valery Khamenya


The following error is mistakenly reported:

{{  java.lang.RuntimeException: Multiple py4j files found under 
}}{{/usr/lib/spark/python/lib}}

{{The error happens e.g. during the execution of import plotly}}

{{The content of /usr/lib/spark/python/lib :}}

{{$ ll /usr/lib/spark/python/lib}}
{{total 580}}
{{-rw-r--r-- 1 root root 42437 29. Jun 18:11 py4j-0.10.7-src.zip}}
{{-rw-r--r-- 1 root root 1445 29. Jun 18:11 PY4J_LICENSE.txt}}
{{lrwxrwxrwx 1 root root 19 20. Aug 14:01 py4j-src.zip -> py4j-0.10.7-src.zip}}
{{-rw-r--r-- 1 root root 543385 29. Jun 18:11 pyspark.zip}}

That is, a symlink is treated as a file.

The line of code containing bug:

[https://github.com/apache/zeppelin/blob/001c621c773db837b4c90bbf7868c94f284247ce/spark/interpreter/src/main/java/org/apache/zeppelin/spark/PythonUtils.java#L74]

If symlinks are resolved to the explicit path to the targeted file, then there 
will be only one file.

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to