[
https://issues.apache.org/jira/browse/BIGTOP-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14304946#comment-14304946
]
jay vyas commented on BIGTOP-1536:
----------------------------------
[~rnowling] okay, I tried using your LoadData method, same error.
{noformat}
15/02/04 05:51:06 INFO rdd.HadoopRDD: Input split:
file:/var/folders/dh/m4sz1bc971d_qkhz8yxmqz8m0000gn/T/sparkDriverSuiteGeneratedData6619154423917988266/transactions/part-00000:0+5706813
15/02/04 05:51:06 INFO rdd.HadoopRDD: Input split:
file:/var/folders/dh/m4sz1bc971d_qkhz8yxmqz8m0000gn/T/sparkDriverSuiteGeneratedData6619154423917988266/transactions/part-00001:0+5713106
15/02/04 05:51:06 ERROR executor.Executor: Exception in task 0.0 in stage 3.0
(TID 6)
java.io.IOException:
file:/var/folders/dh/m4sz1bc971d_qkhz8yxmqz8m0000gn/T/sparkDriverSuiteGeneratedData6619154423917988266/transactions/part-00000
not a SequenceFile
at org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:1513)
at
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1486)
at
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1475)
at
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1470)
at
org.apache.hadoop.mapred.SequenceFileRecordReader.<init>(SequenceFileRecordReader.java:43)
at
org.apache.hadoop.mapred.SequenceFileInputFormat.getRecordReader(SequenceFileInputFormat.java:59)
at org.apache.spark.rdd.HadoopRDD$$anon$1.<init>(HadoopRDD.scala:197)
at org.apache.spark.rdd.HadoopRDD.compute(HadoopRDD.scala:188)
at org.apache.spark.rdd.HadoopRDD.compute(HadoopRDD.scala:97)
at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:262)
at org.apache.spark.rdd.RDD.iterator(RDD.scala:229)
at org.apache.spark.rdd.FlatMappedRDD.compute(FlatMappedRDD.scala:33)
at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:262)
at org.apache.spark.rdd.RDD.iterator(RDD.scala:229)
at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:62)
at org.apache.spark.scheduler.Task.run(Task.scala:54)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:177)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
15/02/04 05:51:06 ERROR executor.Executor: Exception in task 1.0 in stage 3.0
(TID 7)
{noformat}
> Add Basic Sales Analytics Example to BPS Spark
> ----------------------------------------------
>
> Key: BIGTOP-1536
> URL: https://issues.apache.org/jira/browse/BIGTOP-1536
> Project: Bigtop
> Issue Type: Improvement
> Components: blueprints
> Reporter: RJ Nowling
> Assignee: RJ Nowling
> Attachments: BIGTOP-1536.patch, firstpass.patch
>
>
> Using the Spark data generator and ETL script (BIGTOP-1535), add a simple
> Spark sales analytics example that computes basic stats such as:
> * Number of sales per category per month or quarter
> * Top selling items in each category per month or quarter
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)