[ 
https://issues.apache.org/jira/browse/BIGTOP-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14304946#comment-14304946
 ] 

jay vyas commented on BIGTOP-1536:
----------------------------------

[~rnowling] okay, I tried using your LoadData method, same error.

{noformat}
15/02/04 05:51:06 INFO rdd.HadoopRDD: Input split: 
file:/var/folders/dh/m4sz1bc971d_qkhz8yxmqz8m0000gn/T/sparkDriverSuiteGeneratedData6619154423917988266/transactions/part-00000:0+5706813
15/02/04 05:51:06 INFO rdd.HadoopRDD: Input split: 
file:/var/folders/dh/m4sz1bc971d_qkhz8yxmqz8m0000gn/T/sparkDriverSuiteGeneratedData6619154423917988266/transactions/part-00001:0+5713106
15/02/04 05:51:06 ERROR executor.Executor: Exception in task 0.0 in stage 3.0 
(TID 6)
java.io.IOException: 
file:/var/folders/dh/m4sz1bc971d_qkhz8yxmqz8m0000gn/T/sparkDriverSuiteGeneratedData6619154423917988266/transactions/part-00000
 not a SequenceFile
        at org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:1513)
        at 
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1486)
        at 
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1475)
        at 
org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1470)
        at 
org.apache.hadoop.mapred.SequenceFileRecordReader.<init>(SequenceFileRecordReader.java:43)
        at 
org.apache.hadoop.mapred.SequenceFileInputFormat.getRecordReader(SequenceFileInputFormat.java:59)
        at org.apache.spark.rdd.HadoopRDD$$anon$1.<init>(HadoopRDD.scala:197)
        at org.apache.spark.rdd.HadoopRDD.compute(HadoopRDD.scala:188)
        at org.apache.spark.rdd.HadoopRDD.compute(HadoopRDD.scala:97)
        at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:262)
        at org.apache.spark.rdd.RDD.iterator(RDD.scala:229)
        at org.apache.spark.rdd.FlatMappedRDD.compute(FlatMappedRDD.scala:33)
        at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:262)
        at org.apache.spark.rdd.RDD.iterator(RDD.scala:229)
        at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:62)
        at org.apache.spark.scheduler.Task.run(Task.scala:54)
        at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:177)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
15/02/04 05:51:06 ERROR executor.Executor: Exception in task 1.0 in stage 3.0 
(TID 7)
{noformat}

> Add Basic Sales Analytics Example to BPS Spark
> ----------------------------------------------
>
>                 Key: BIGTOP-1536
>                 URL: https://issues.apache.org/jira/browse/BIGTOP-1536
>             Project: Bigtop
>          Issue Type: Improvement
>          Components: blueprints
>            Reporter: RJ Nowling
>            Assignee: RJ Nowling
>         Attachments: BIGTOP-1536.patch, firstpass.patch
>
>
> Using the Spark data generator and ETL script (BIGTOP-1535), add a simple 
> Spark sales analytics example that computes basic stats such as:
> * Number of sales per category per month or quarter
> * Top selling items in each category per month or quarter



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to