boyan created ZEPPELIN-6152:
-------------------------------
Summary: Unable so execute spark jobs after upgrade of Zeppelin
0.10.1 and Spark 3.3.3
Key: ZEPPELIN-6152
URL: https://issues.apache.org/jira/browse/ZEPPELIN-6152
Project: Zeppelin
Issue Type: Bug
Components: spark
Affects Versions: 0.10.1
Environment: Standalone Zeppelin Instance running in a container on
OpenShift cluster.
Used to trigger remote spark's jobs on remote yarn cluster.
Reporter: boyan
Hello Zeppelin Team,
I hope you can help us identify what is causing the issues. We are using
Zeppelin with Spark interpreter with Mapr (Hadoop). In our environment with
Zeppelin 0.9.0, Spark 2.4.4 and Mapr 6 we have no issues of executing simple
Spark commands (e.g. println("Hello World") ) using spark interpreter.
However, in our preparation for upgrade to Zeppelin 0.10.1, Spark 3.3.3 and
Mapr 7, we found an issue in our test environment. We could not trigger anymore
simple spark command. We are always getting this error:
INFO [2024-11-18 10:55:43,398] (\{RegisterThread}
RemoteInterpreterServer.java[run]:603) - Start registration INFO [2024-11-18
10:55:43,512] (\{RegisterThread} RemoteInterpreterServer.java[run]:617) -
Registering interpreter process ERROR [2024-11-18 10:55:43,519]
(\{RegisterThread} RemoteInterpreterServer.java[run]:621) - Error while
registering interpreter: RegisterInfo(host:x.x.x.x, port:29037,
interpreterGroupId:spark-shared_process), cause: {} java.lang.RuntimeException
at
org.apache.zeppelin.interpreter.remote.PooledRemoteClient.callRemoteFunction(PooledRemoteClient.java:119)
at
org.apache.zeppelin.interpreter.remote.RemoteInterpreterEventClient.callRemoteFunction(RemoteInterpreterEventClient.java:80)
at
org.apache.zeppelin.interpreter.remote.RemoteInterpreterEventClient.registerInterpreterProcess(RemoteInterpreterEventClient.java:88)
at
org.apache.zeppelin.interpreter.remote.RemoteInterpreterServer$RegisterRunnable.run(RemoteInterpreterServer.java:618)
at java.base/java.lang.Thread.run(Thread.java:829) INFO [2024-11-18
10:55:43,520] (\{RegisterThread} RemoteInterpreterServer.java[shutdown]:250) -
Unregister interpreter process ERROR [2024-11-18 10:55:43,526]
(\{RegisterThread} RemoteInterpreterServer.java[shutdown]:253) - Fail to
unregister remote interpreter process java.lang.RuntimeException at
org.apache.zeppelin.interpreter.remote.PooledRemoteClient.callRemoteFunction(PooledRemoteClient.java:119)
at
org.apache.zeppelin.interpreter.remote.RemoteInterpreterEventClient.callRemoteFunction(RemoteInterpreterEventClient.java:80)
at
org.apache.zeppelin.interpreter.remote.RemoteInterpreterEventClient.unRegisterInterpreterProcess(RemoteInterpreterEventClient.java:95)
at
org.apache.zeppelin.interpreter.remote.RemoteInterpreterServer.shutdown(RemoteInterpreterServer.java:251)
at
org.apache.zeppelin.interpreter.remote.RemoteInterpreterServer$RegisterRunnable.run(RemoteInterpreterServer.java:623)
at java.base/java.lang.Thread.run(Thread.java:829) INFO [2024-11-18
10:55:43,526] (\{RegisterThread} RemoteInterpreterServer.java[run]:640) -
Registration finished INFO [2024-11-18 10:55:43,526] (\{ShutdownThread}
RemoteInterpreterServer.java[run]:657) - Shutting down...
The way we trigger the spark jobs is with yarn cluster and we use the same
configurations as before. However, we found that Zeppelin's interpreter
triggering code has changed dramatically from the old version of Zeppelin.
We have no issues of triggering spark-submit jobs directly from the spark
folder or with Zeppelin's spark-submit interpreter.
Thank you for your support!
--
This message was sent by Atlassian Jira
(v8.20.10#820010)