onjee created ZEPPELIN-6030:
-------------------------------
Summary: Bug Report: Apache Zeppelin 0.11.1 Issue with Spark
Interpreter
Key: ZEPPELIN-6030
URL: https://issues.apache.org/jira/browse/ZEPPELIN-6030
Project: Zeppelin
Issue Type: Bug
Components: build
Affects Versions: 0.11.1
Environment: *Environment:*
* Apache Zeppelin version: 0.11.1
* Operating System: Ubuntu 24.04 LTS
* Java version: OpenJDK 8
* Spark version: 2.4.7
* Hadoop: Fully distributed mode
* YARN: Fully distributed mode
* Zeppelin server and Spark running on different servers
* Spark configured to use YARN in cluster mode
Reporter: onjee
Fix For: 0.11.1
*Description:*
While using Apache Zeppelin version 0.11.1, an issue occurs when running the
Spark interpreter. The interpreter fails to initialize properly due to
unexpected metadata files that were likely included during the build process on
a macOS system. These metadata files cause errors in the initialization of the
Spark interpreter, preventing it from functioning correctly.
Maybe there's an error in the build.
*Steps to Reproduce:*
# Set up Apache Zeppelin version 0.11.1.
# Configure the Spark interpreter.
# Attempt to run a Spark paragraph.
*Expected Behavior:* The Spark paragraph should execute without any errors.
*Actual Behavior:* The Spark interpreter fails to initialize, and the following
error messages are logged:
Here, you can see that the JAR file you want to run starts with ._ and it's
actually not a JAR file, it's a metadata file for apple mac.
{code}
Caused by: java.io.IOException: Error accessing
/tmp/hadoop-on5949/nm-local-dir/usercache/on5949/appcache/application_1719905050899_0006/container_1719905050899_0006_01_000001/._spark-scala-2.12-0.11.1.jar
at
scala.reflect.io.FileZipArchive.scala$reflect$io$FileZipArchive$$openZipFile(ZipArchive.scala:190)
at scala.reflect.io.FileZipArchive.root$lzycompute(ZipArchive.scala:238) at
scala.reflect.io.FileZipArchive.root(ZipArchive.scala:235) at
scala.reflect.io.FileZipArchive.allDirs$lzycompute(ZipArchive.scala:272) at
scala.reflect.io.FileZipArchive.allDirs(ZipArchive.scala:272) at
scala.tools.nsc.classpath.ZipArchiveFileLookup.findDirEntry(ZipArchiveFileLookup.scala:76)
at
scala.tools.nsc.classpath.ZipArchiveFileLookup.list(ZipArchiveFileLookup.scala:63)
at
scala.tools.nsc.classpath.ZipArchiveFileLookup.list$(ZipArchiveFileLookup.scala:62)
at
scala.tools.nsc.classpath.ZipAndJarClassPathFactory$ZipArchiveClassPath.list(ZipAndJarFileLookupFactory.scala:58)
at
scala.tools.nsc.classpath.AggregateClassPath.$anonfun$list$3(AggregateClassPath.scala:105)
... 40 more Caused by: java.util.zip.ZipException: error in opening zip file
at java.util.zip.ZipFile.open(Native Method) at
java.util.zip.ZipFile.<init>(ZipFile.java:231) at
java.util.zip.ZipFile.<init>(ZipFile.java:157) at
java.util.zip.ZipFile.<init>(ZipFile.java:171) at
scala.reflect.io.FileZipArchive.scala$reflect$io$FileZipArchive$$openZipFile(ZipArchive.scala:187)
... 49 more
{code}
*Additional Information:*
* The issue is caused by macOS metadata files prefixed with '._' being
included in the Spark interpreter directory. (I don't think this metadata file
was in the previous version, so I think it went in by mistake.)
*References:*
* [Stack Overflow discussion on similar
issue|https://stackoverflow.com/questions/62954023/interpreterexception-in-apache-zeppelin-v-0-11-1-executing-a-paragraph]
* [My Velog post detailing the bug and
solution(Korean)|https://velog.io/@on5949/%ED%99%95%EC%8B%A4%ED%95%98%EC%A7%80-%EC%95%8A%EC%9D%8C-zeppelin-0.11.1-%EB%B2%84%EA%B7%B8-%EB%A6%AC%ED%8F%AC%ED%8A%B8]
* [Where I downloaded Zeppelin |https://zeppelin.apache.org/download.html]
--
This message was sent by Atlassian Jira
(v8.20.10#820010)