You can load that binary up as a String RDD, then map over that RDD and
convert each row to your case class representing the data. In the map stage
you could also map the input string into an RDD of JSON values and use the
following function to convert it into a DF
http://spark.apache.org/docs/late
You can call *foreachRDD*(*func*) on the output from the final stage, then
check the time if it's the 15th min of an hour then you flush the output to
DB else you don't.
Let me know if that approach works.
On Tue, Mar 8, 2016 at 2:10 PM, ayan guha wrote:
> Yes if it falls within the batch. But i
config is not being set and it's
falling back to default.
Let me know if that helps.
Saurabh Bajaj
On Tue, Mar 8, 2016 at 6:25 PM, Andy Davidson wrote:
> Hi Ted
>
> I believe by default cassandra listens on 9042
>
> From: Ted Yu
> Date: Tuesday, March 8, 2016 at 6:11 PM
>