Shafique Jamal created ZEPPELIN-2869: ----------------------------------------
Summary: .take(1) method on Dataset[String] fails due to inability to deserialize (NoSuchMethodError) Key: ZEPPELIN-2869 URL: https://issues.apache.org/jira/browse/ZEPPELIN-2869 Project: Zeppelin Issue Type: Bug Components: zeppelin-interpreter Affects Versions: 0.7.2 Environment: Mac OS X Reporter: Shafique Jamal Running the following command fails (the first line succeeds, the second line fails): {{val yelpdata = spark.read.textFile("s3a://sparkcookbook/yelpdata") yelpdata.take(1)}} with the following error: {{yelpdata: org.apache.spark.sql.Dataset[String] = [value: string] java.lang.NoClassDefFoundError: Could not initialize class org.apache.spark.rdd.RDDOperationScope$ at org.apache.spark.sql.execution.SparkPlan.executeQuery(SparkPlan.scala:132) at org.apache.spark.sql.execution.SparkPlan.execute(SparkPlan.scala:113) at org.apache.spark.sql.execution.SparkPlan.getByteArrayRdd(SparkPlan.scala:225) at org.apache.spark.sql.execution.SparkPlan.executeTake(SparkPlan.scala:308) at org.apache.spark.sql.execution.CollectLimitExec.executeCollect(limit.scala:38) at org.apache.spark.sql.Dataset$$anonfun$org$apache$spark$sql$Dataset$$execute$1$1.apply(Dataset.scala:2371) at org.apache.spark.sql.execution.SQLExecution$.withNewExecutionId(SQLExecution.scala:57) at org.apache.spark.sql.Dataset.withNewExecutionId(Dataset.scala:2765) at org.apache.spark.sql.Dataset.org$apache$spark$sql$Dataset$$execute$1(Dataset.scala:2370) at org.apache.spark.sql.Dataset.org$apache$spark$sql$Dataset$$collect(Dataset.scala:2377) at org.apache.spark.sql.Dataset$$anonfun$head$1.apply(Dataset.scala:2113) at org.apache.spark.sql.Dataset$$anonfun$head$1.apply(Dataset.scala:2112) at org.apache.spark.sql.Dataset.withTypedCallback(Dataset.scala:2795) at org.apache.spark.sql.Dataset.head(Dataset.scala:2112) at org.apache.spark.sql.Dataset.take(Dataset.scala:2327)}} -- This message was sent by Atlassian JIRA (v6.4.14#64029)