Hello,
I am having trouble running a pyspark note (zeppelin newbie, could well be
pilot error).
The note is
%pyspark
print ‘hello world’
The note transitions to “PENDING” and then “RUNNING” but never finishes after
that.
From the zeppelin server logs:
INFO [2015-03-30 12:24:10,512] ({pool-2-thread-2}
RemoteInterpreterProcess.java[reference]:74) - Run interpreter process
/Users/rvenkatesh/dev/asf/zeppelin/bin/interpreter.sh -d
/Users/rvenkatesh/dev/asf/zeppelin/interpreter/spark -p 59569
INFO [2015-03-30 12:24:11,570] ({pool-2-thread-2}
RemoteInterpreter.java[init]:114) - Create remote interpreter
com.nflabs.zeppelin.spark.SparkInterpreter
INFO [2015-03-30 12:24:11,623] ({pool-2-thread-2}
RemoteInterpreter.java[init]:114) - Create remote interpreter
com.nflabs.zeppelin.spark.PySparkInterpreter
INFO [2015-03-30 12:24:11,628] ({pool-2-thread-2}
RemoteInterpreter.java[init]:114) - Create remote interpreter
com.nflabs.zeppelin.spark.SparkSqlInterpreter
INFO [2015-03-30 12:24:11,631] ({pool-2-thread-2}
RemoteInterpreter.java[init]:114) - Create remote interpreter
com.nflabs.zeppelin.spark.DepInterpreter
INFO [2015-03-30 12:24:11,635] ({pool-2-thread-2}
RemoteInterpreter.java[open]:143) - open remote interpreter
com.nflabs.zeppelin.spark.PySparkInterpreter
INFO [2015-03-30 12:24:11,682] ({pool-2-thread-2} Paragraph.java[jobRun]:182)
- RUN :
print 'hello world'
INFO [2015-03-30 12:24:19,444] ({Thread-24}
RemoteScheduler.java[getStatus]:185) - getStatus from remote RUNNING
INFO [2015-03-30 12:24:19,444] ({Thread-24}
NotebookServer.java[broadcast]:205) - SEND >> NOTE
INFO [2015-03-30 12:24:19,446] ({Thread-25}
NotebookServer.java[broadcast]:205) - SEND >> PROGRESS
INFO [2015-03-30 12:24:19,955] ({Thread-25}
NotebookServer.java[broadcast]:205) - SEND >> PROGRESS
… ad infinetum
Nothing interesting in the spark interpreter logs.
Any help appreciated.
Thanks!
Ram