from:"noppanit"

This code makes the job runs 2x as long. Is there a way to improve it?

2017-09-28 Thread Noppanit Charassinvichai

We're trying to filter out some records of the output that we have to another table in ORC and the job takes twice as long. Not sure if there's a better way to do this? Here's the code jsonRows.foreachRDD(r => { val jsonDf = sqlSession.read.schema(sparrowSchema.schema).json(r) val cnsDf = sql

Re: ClassNotFoundException for Workers

2017-07-31 Thread Noppanit Charassinvichai

;com.amazonaws" % "aws-java-sdk-core" % "1.11.155" Not sure if I need special configuration? On Tue, 25 Jul 2017 at 04:17 周康 wrote: > Ensure com.amazonaws.services.s3.AmazonS3ClientBuilder in your classpath > which include your application jar and attached execut

ClassNotFoundException for Workers

2017-07-19 Thread Noppanit Charassinvichai

I have this spark job which is using S3 client in mapPartition. And I get this error Job aborted due to stage failure: Task 0 in stage 3.0 failed 4 times, most recent failure: Lost task 0.3 in stage 3.0 (TID 74, ip-10-90-78-177.ec2.internal, executor 11): java.lang.NoClassDefFoundError: Could not

[Spark Streaming] How to make this code work?

2017-07-18 Thread Noppanit Charassinvichai

I'm super new to Spark and I'm writing this job to parse nginx log to ORC file format so it can be read from Presto. We wrote LogLine2Json which parse a line of nginx log to json. And that works fine. val sqs = streamContext.receiverStream(new SQSReceiver("elb") //.credentials("key", "se

Apache Spark example split/merge shards

2017-01-16 Thread noppanit

I'm totally new to Spark and I'm trying to learn from the example. I'm following this example https://github.com/apache/spark/blob/master/external/kinesis-asl/src/main/scala/org/apache/spark/examples/streaming/KinesisWordCountASL.scala. It works well. But I do have one question. Every time I spl

This code makes the job runs 2x as long. Is there a way to improve it?

Re: ClassNotFoundException for Workers

ClassNotFoundException for Workers

[Spark Streaming] How to make this code work?

Apache Spark example split/merge shards

5 matches

Site Navigation

Mail list logo

Footer information