Re: What's the simplest way to Read Avro records from Kafka to Spark DataSet/DataFrame?

2017-07-03 Thread kant kodali
import org.apache.avro.Schemaimport org.apache.spark.sql.SparkSession val schema = new Schema.Parser().parse(new File("user.avsc"))val spark = SparkSession.builder().master("local").getOrCreate() spark .read .format("com.databricks.spark.avro") .option("avroSchema", schema.toString) .load("

Re: What's the simplest way to Read Avro records from Kafka to Spark DataSet/DataFrame?

2017-06-29 Thread kant kodali
Forgot to mention I am getting a stream of Avro records and I want to do Structured streaming on these Avro records but first I wan to be able to parse them and put them in a DataSet or something like that. On Thu, Jun 29, 2017 at 12:56 AM, kant kodali wrote: > Hi All, > > What's the simplest wa

What's the simplest way to Read Avro records from Kafka to Spark DataSet/DataFrame?

2017-06-29 Thread kant kodali
Hi All, What's the simplest way to Read Avro records from Kafka and put it into Spark DataSet/DataFrame without using Confluent Schema registry or Twitter Bijection API? Thanks!