robinlin created ZEPPELIN-1413:
----------------------------------

             Summary: Pyspark, can not create data frame by using sqlContext
                 Key: ZEPPELIN-1413
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-1413
             Project: Zeppelin
          Issue Type: Bug
          Components: Core, zeppelin-interpreter
    Affects Versions: 0.6.1
         Environment: docker container, ubuntu 14.04, spark 2.0, python 2.7
            Reporter: robinlin


I got problem when I try to use sqlContext to create a data frame in Zeppelin 
notebook, but works fine in pysprak console.
{code:java}
l = [('alice', 1)]
sqlContext.createDataFrame(l).collect()
{code}
The Error message
{code:java}
Traceback (most recent call last):
  File "/tmp/zeppelin_pyspark-8877291249558119831.py", line 266, in <module>
    raise Exception(traceback.format_exc())
Exception: Traceback (most recent call last):
  File "/tmp/zeppelin_pyspark-8877291249558119831.py", line 264, in <module>
    exec(code)
  File "<stdin>", line 2, in <module>
  File "/opt/spark/python/pyspark/sql/context.py", line 299, in createDataFrame
    return self.sparkSession.createDataFrame(data, schema, samplingRatio)
  File "/opt/spark/python/lib/py4j-0.10.1-src.zip/py4j/java_gateway.py", line 
933, in __call__
    answer, self.gateway_client, self.target_id, self.name)
  File "/opt/spark/python/pyspark/sql/utils.py", line 63, in deco
    return f(*a, **kw)
  File "/opt/spark/python/lib/py4j-0.10.1-src.zip/py4j/protocol.py", line 316, 
in get_return_value
    format(target_id, ".", name, value))
Py4JError: An error occurred while calling o49.createDataFrame. Trace:
py4j.Py4JException: Method createDataFrame([class java.util.ArrayList, null, 
null]) does not exist
        at py4j.reflection.ReflectionEngine.getMethod(ReflectionEngine.java:318)
        at py4j.reflection.ReflectionEngine.getMethod(ReflectionEngine.java:326)
        at py4j.Gateway.invoke(Gateway.java:272)
        at py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:128)
        at py4j.commands.CallCommand.execute(CallCommand.java:79)
        at py4j.GatewayConnection.run(GatewayConnection.java:211)
        at java.lang.Thread.run(Thread.java:745)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to