Hi

I am trying to run  zeppelin remotely  with my CDH5.4.0  ( spark1.3.0 ) and
my spark cluster  is  yarn based.

how i build zeppelin :

-> git clone https://github.com/apache/incubator-zeppelin
-> mvn clean package -Pspark-1.3 -Dhadoop.version=2.6.0-cdh5.4.0
-Phadoop-2.6  -Pyarn -DskipTests


when i run following program using  local mode then its run perfectly fine.
"master": "local[*]",

%spark
val textFile = sc.textFile("hdfs://master:8020/user/prateek/bigdata.csv", 1)
textFile.count


but when i changed zeppelin master to yarn-cluster it doesnt work.

done following changes in "conf/zeppelin-env.sh"

export MASTER=yarn-cluster
export HADOOP_CONF_DIR=/home/ubuntu/hadoop/

here is  error logs:
INFO [2015-06-04 15:51:56,559] ({WebSocketWorker-8}
NotebookServer.java[onMessage]:100) - RECEIVE << RUN_PARAGRAPH
 INFO [2015-06-04 15:51:56,566] ({WebSocketWorker-8}
NotebookServer.java[broadcast]:251) - SEND >> NOTE
 INFO [2015-06-04 15:51:56,568] ({WebSocketWorker-8}
NotebookServer.java[broadcast]:251) - SEND >> NOTE
 INFO [2015-06-04 15:51:56,571] ({pool-1-thread-2}
SchedulerFactory.java[jobStarted]:132) - Job
paragraph_1433456389119_-50569378 started by scheduler
remoteinterpreter_347577382
 INFO [2015-06-04 15:51:56,571] ({pool-1-thread-2}
Paragraph.java[jobRun]:194) - run paragraph 20150604-151949_5322023 using
spark org.apache.zeppelin.interpreter.LazyOpenInterpreter@756ba7d3
 INFO [2015-06-04 15:51:56,575] ({pool-1-thread-2}
Paragraph.java[jobRun]:211) - RUN : val textFile =
sc.textFile("hdfs://master:8020/user/prateek/bigdata.csv", 1)
textFile.count
 INFO [2015-06-04 15:51:56,676] ({Thread-35}
NotebookServer.java[broadcast]:251) - SEND >> NOTE
ERROR [2015-06-04 15:51:57,949] ({pool-1-thread-2} Job.java[run]:183) - Job
failed
org.apache.zeppelin.interpreter.InterpreterException:
org.apache.thrift.TApplicationException: Internal error processing interpret
        at
org.apache.zeppelin.interpreter.remote.RemoteInterpreter.interpret(RemoteInterpreter.java:221)
        at
org.apache.zeppelin.interpreter.LazyOpenInterpreter.interpret(LazyOpenInterpreter.java:93)
        at org.apache.zeppelin.notebook.Paragraph.jobRun(Paragraph.java:212)
        at org.apache.zeppelin.scheduler.Job.run(Job.java:170)
        at
org.apache.zeppelin.scheduler.RemoteScheduler$JobRunner.run(RemoteScheduler.java:296)
        at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
        at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:178)
        at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:292)
        at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.thrift.TApplicationException: Internal error
processing interpret
        at
org.apache.thrift.TApplicationException.read(TApplicationException.java:108)
        at
org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:71)
        at
org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.recv_interpret(RemoteInterpreterService.java:190)
        at
org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.interpret(RemoteInterpreterService.java:175)
        at
org.apache.zeppelin.interpreter.remote.RemoteInterpreter.interpret(RemoteInterpreter.java:204)
        ... 11 more
 INFO [2015-06-04 15:51:57,951] ({Thread-35}
NotebookServer.java[afterStatusChange]:571) - Job 20150604-151949_5322023
is finished
 INFO [2015-06-04 15:51:57,957] ({Thread-35}
NotebookServer.java[broadcast]:251) - SEND >> NOTE
 INFO [2015-06-04 15:51:57,960] ({pool-1-thread-2}
SchedulerFactory.java[jobFinished]:138) - Job
paragraph_1433456389119_-50569378 finished by scheduler
remoteinterpreter_347577382
ERROR [2015-06-04 15:51:59,992] ({Thread-36}
JobProgressPoller.java[run]:57) - Can not get or update progress
org.apache.zeppelin.interpreter.InterpreterException:
org.apache.thrift.TApplicationException: Internal error processing
getProgress
        at
org.apache.zeppelin.interpreter.remote.RemoteInterpreter.getProgress(RemoteInterpreter.java:286)
        at
org.apache.zeppelin.interpreter.LazyOpenInterpreter.getProgress(LazyOpenInterpreter.java:110)
        at
org.apache.zeppelin.notebook.Paragraph.progress(Paragraph.java:179)
        at
org.apache.zeppelin.scheduler.JobProgressPoller.run(JobProgressPoller.java:54)
Caused by: org.apache.thrift.TApplicationException: Internal error
processing getProgress
        at
org.apache.thrift.TApplicationException.read(TApplicationException.java:108)
        at
org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:71)
        at
org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.recv_getProgress(RemoteInterpreterService.java:235)
        at
org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.getProgress(RemoteInterpreterService.java:221)
        at
org.apache.zeppelin.interpreter.remote.RemoteInterpreter.getProgress(RemoteInterpreter.java:284)
        ... 3 more


please help me to short out above error.

Regards
Prateek

Reply via email to