GitHub user xuanyuanking opened a pull request: https://github.com/apache/spark/pull/22878
[SPARK-25789][SQL] Support for Dataset of Avro ## What changes were proposed in this pull request? Please credit to @bdrillard cause this mainly based on his previous work. Support for Dataset of Avro records in an API that would allow the user to provide a class to an Encoder for Avro, analogous to the Bean encoder. - Add `ObjectCast` and `InitializeAvroObject` expression - Add an AvroEncoder for Datasets of Avro records to Spark ## How was this patch tested? Add UT in AvroSuite.scala and manual test by modified SQLExample with external avro package. You can merge this pull request into a Git repository by running: $ git pull https://github.com/xuanyuanking/spark SPARK-25789 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/22878.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #22878 ---- commit c70ddb340c58ccd193df60496fe57262e15cf31a Author: Yuanjian Li <xyliyuanjian@...> Date: 2018-10-29T15:50:14Z SPARK-25789: Support for Dataset of Avro ---- --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org