[jira] [Commented] (ZEPPELIN-24) Exception when reading files: textFile / parquetFile

Jongyoul Lee (JIRA) Thu, 02 Apr 2015 08:10:59 -0700

    [ 
https://issues.apache.org/jira/browse/ZEPPELIN-24?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392806#comment-14392806
 ]


Jongyoul Lee commented on ZEPPELIN-24:
--------------------------------------

Your environments looks good without error, but in common case, this error is 
about the version of sparks between Zeppelin and Spark of your cluster. Do you 
perhaps use Spark cluster with CDH-versioned? If not, you should check your 
master and worker logs. or simply, try {{file:///data/bank-full.csv}}

> Exception when reading files: textFile / parquetFile
> ----------------------------------------------------
>
>                 Key: ZEPPELIN-24
>                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-24
>             Project: Zeppelin
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.5.0
>            Reporter: Sebastian YEPES FERNANDEZ
>
> Hello,
> I have just encountered the following issue when running the last version 
> #b6768c, Has anyone encountered this issue as well? 
> Build options:
> -Phadoop-2.4 -Dhadoop.version=2.4.0 -Pspark-1.3 -Dspark.version=1.3.0
> {code:title=%Spark|borderStyle=solid}
> val bankText = sc.textFile("/data/bank-full.csv")
> bankText: org.apache.spark.rdd.RDD[String] = /data/bank-full.csv 
> MapPartitionsRDD[1] at textFile at <console>:23
> bankText.count
> org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in 
> stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in stage 0.0 
> (TID 7, n1): ExecutorLostFailure (executor 3 lost)
> Driver stacktrace:
>       at 
> org.apache.spark.scheduler.DAGScheduler.org$apache$spark$scheduler$DAGScheduler$$failJobAndIndependentStages(DAGScheduler.scala:1203)
>       at 
> org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1192)
>       at 
> org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1191)
>       at 
> scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59)
>       at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:47)
>       at 
> org.apache.spark.scheduler.DAGScheduler.abortStage(DAGScheduler.scala:1191)
>       at 
> org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:693)
>       at 
> org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:693)
>       at scala.Option.foreach(Option.scala:236)
>       at 
> org.apache.spark.scheduler.DAGScheduler.handleTaskSetFailed(DAGScheduler.scala:693)
>       at 
> org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:1393)
>       at 
> org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:1354)
>       at org.apache.spark.util.EventLoop$$anon$1.run(EventLoop.scala:48)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (ZEPPELIN-24) Exception when reading files: textFile / parquetFile

Reply via email to