I am trying the simple thing in pyspark: %pyspark rdd = sc.parallelize(["1","2","3"]) print(rdd.collect()) z.show(sqlContext.createDataFrame(rdd))
AND keep getting error: Traceback (most recent call last): File "/tmp/zeppelin_pyspark.py", line 116, in <module> eval(compiledCode) File "<string>", line 3, in <module> File "/home/cto/Downloads/incubator-zeppelin/interpreter/spark/pyspark/pyspark.zip/pyspark/sql/context.py", line 339, in createDataFrame _verify_type(row, schema) File "/home/cto/Downloads/incubator-zeppelin/interpreter/spark/pyspark/pyspark.zip/pyspark/sql/types.py", line 1013, in _verify_type % (dataType, type(obj))) TypeError: StructType(List(StructField(Number,StringType,true))) can not accept object in type <type 'str'> This show be easy... Eran