Hi I am trying to run zeppelin remotely with my CDH5.4.0 ( spark1.3.0 ) and my spark cluster is yarn based.
how i build zeppelin : -> git clone https://github.com/apache/incubator-zeppelin -> mvn clean package -Pspark-1.3 -Dhadoop.version=2.6.0-cdh5.4.0 -Phadoop-2.6 -Pyarn -DskipTests when i run following program using local mode then its run perfectly fine. "master": "local[*]", %spark val textFile = sc.textFile("hdfs://master:8020/user/prateek/bigdata.csv", 1) textFile.count but when i changed zeppelin master to yarn-cluster it doesnt work. done following changes in "conf/zeppelin-env.sh" export MASTER=yarn-cluster export HADOOP_CONF_DIR=/home/ubuntu/hadoop/ here is error logs: INFO [2015-06-04 15:51:56,559] ({WebSocketWorker-8} NotebookServer.java[onMessage]:100) - RECEIVE << RUN_PARAGRAPH INFO [2015-06-04 15:51:56,566] ({WebSocketWorker-8} NotebookServer.java[broadcast]:251) - SEND >> NOTE INFO [2015-06-04 15:51:56,568] ({WebSocketWorker-8} NotebookServer.java[broadcast]:251) - SEND >> NOTE INFO [2015-06-04 15:51:56,571] ({pool-1-thread-2} SchedulerFactory.java[jobStarted]:132) - Job paragraph_1433456389119_-50569378 started by scheduler remoteinterpreter_347577382 INFO [2015-06-04 15:51:56,571] ({pool-1-thread-2} Paragraph.java[jobRun]:194) - run paragraph 20150604-151949_5322023 using spark org.apache.zeppelin.interpreter.LazyOpenInterpreter@756ba7d3 INFO [2015-06-04 15:51:56,575] ({pool-1-thread-2} Paragraph.java[jobRun]:211) - RUN : val textFile = sc.textFile("hdfs://master:8020/user/prateek/bigdata.csv", 1) textFile.count INFO [2015-06-04 15:51:56,676] ({Thread-35} NotebookServer.java[broadcast]:251) - SEND >> NOTE ERROR [2015-06-04 15:51:57,949] ({pool-1-thread-2} Job.java[run]:183) - Job failed org.apache.zeppelin.interpreter.InterpreterException: org.apache.thrift.TApplicationException: Internal error processing interpret at org.apache.zeppelin.interpreter.remote.RemoteInterpreter.interpret(RemoteInterpreter.java:221) at org.apache.zeppelin.interpreter.LazyOpenInterpreter.interpret(LazyOpenInterpreter.java:93) at org.apache.zeppelin.notebook.Paragraph.jobRun(Paragraph.java:212) at org.apache.zeppelin.scheduler.Job.run(Job.java:170) at org.apache.zeppelin.scheduler.RemoteScheduler$JobRunner.run(RemoteScheduler.java:296) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:178) at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:292) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) Caused by: org.apache.thrift.TApplicationException: Internal error processing interpret at org.apache.thrift.TApplicationException.read(TApplicationException.java:108) at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:71) at org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.recv_interpret(RemoteInterpreterService.java:190) at org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.interpret(RemoteInterpreterService.java:175) at org.apache.zeppelin.interpreter.remote.RemoteInterpreter.interpret(RemoteInterpreter.java:204) ... 11 more INFO [2015-06-04 15:51:57,951] ({Thread-35} NotebookServer.java[afterStatusChange]:571) - Job 20150604-151949_5322023 is finished INFO [2015-06-04 15:51:57,957] ({Thread-35} NotebookServer.java[broadcast]:251) - SEND >> NOTE INFO [2015-06-04 15:51:57,960] ({pool-1-thread-2} SchedulerFactory.java[jobFinished]:138) - Job paragraph_1433456389119_-50569378 finished by scheduler remoteinterpreter_347577382 ERROR [2015-06-04 15:51:59,992] ({Thread-36} JobProgressPoller.java[run]:57) - Can not get or update progress org.apache.zeppelin.interpreter.InterpreterException: org.apache.thrift.TApplicationException: Internal error processing getProgress at org.apache.zeppelin.interpreter.remote.RemoteInterpreter.getProgress(RemoteInterpreter.java:286) at org.apache.zeppelin.interpreter.LazyOpenInterpreter.getProgress(LazyOpenInterpreter.java:110) at org.apache.zeppelin.notebook.Paragraph.progress(Paragraph.java:179) at org.apache.zeppelin.scheduler.JobProgressPoller.run(JobProgressPoller.java:54) Caused by: org.apache.thrift.TApplicationException: Internal error processing getProgress at org.apache.thrift.TApplicationException.read(TApplicationException.java:108) at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:71) at org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.recv_getProgress(RemoteInterpreterService.java:235) at org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.getProgress(RemoteInterpreterService.java:221) at org.apache.zeppelin.interpreter.remote.RemoteInterpreter.getProgress(RemoteInterpreter.java:284) ... 3 more please help me to short out above error. Regards Prateek
