I really cannot figure out what this is about.. (tried to import pandas, in case that is a dependency, but it didn't help.)
>>> from pyspark.sql import SQLContext >>> sqlContext=SQLContext(sc) >>> sqlContext.createDataFrame(l).collect() Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/opt/cloudera/parcels/CDH-5.4.2-1.cdh5.4.2.p0.2/lib/spark/python/pyspark/sql/context.py", line 318, in createDataFrame if has_pandas and isinstance(data, pandas.DataFrame): AttributeError: 'module' object has no attribute 'DataFrame' Would appreciate any pointers. Thanks! Ping