Neeraj created ZEPPELIN-168:
-------------------------------
Summary: loading parquet files fails
Key: ZEPPELIN-168
URL: https://issues.apache.org/jira/browse/ZEPPELIN-168
Project: Zeppelin
Issue Type: Bug
Reporter: Neeraj
When trying to read parquet files I get an NullPointerPointerException. Full
stack trace below.
Works when I try reading the parquet file from spark-shell.
val data = sqlContext.parquetFile(parquetTable)
I built zeppelin using the following :
mvn clean package -Pspark-1.3 -Dhadoop.version=2.5.0-cdh5.3.0 -Phadoop-2.4
-DskipTests
My guess right now that this could be a parquet jars version issue.
The logs do not show any error. The spark interpreter logs also do not show any
error for this particular line but have the following error when the spark
interpreter is starting
java.lang.ClassNotFoundException:
org.apache.hadoop.yarn.server.webproxy.amfilter.AmIpFilter
scala.collection.parallel.CompositeThrowable: Multiple exceptions thrown during
a parallel computation: java.lang.NullPointerException
.
.
.
at scala.collection.parallel.package$$anon$1.alongWith(package.scala:85)
at scala.collection.parallel.Task$class.mergeThrowables(Tasks.scala:86)
at
scala.collection.parallel.mutable.ParArray$Map.mergeThrowables(ParArray.scala:650)
at scala.collection.parallel.Task$class.tryMerge(Tasks.scala:72)
at
scala.collection.parallel.mutable.ParArray$Map.tryMerge(ParArray.scala:650)
at
scala.collection.parallel.AdaptiveWorkStealingTasks$WrappedTask$class.internal(Tasks.scala:190)
at
scala.collection.parallel.AdaptiveWorkStealingForkJoinTasks$WrappedTask.internal(Tasks.scala:514)
at
scala.collection.parallel.AdaptiveWorkStealingTasks$WrappedTask$class.compute(Tasks.scala:162)
at
scala.collection.parallel.AdaptiveWorkStealingForkJoinTasks$WrappedTask.compute(Tasks.scala:514)
at
scala.concurrent.forkjoin.RecursiveAction.exec(RecursiveAction.java:160)
at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
at
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
at
scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
at
scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)