GitHub user gengliangwang opened a pull request:
https://github.com/apache/spark/pull/21762
[SPARK-24800][SQL] Refactor Avro Serializer and Deserializer
## What changes were proposed in this pull request?
Currently the Avro Deserializer converts input Avro format data to `Row`,
and then convert the `Row` to `InternalRow`.
While the Avro Serializer converts `InternalRow` to `Row`, and then output
Avro format data.
This PR allows direct conversion between `InternalRow` and Avro format data.
Credits to @cloud-fan .
## How was this patch tested?
Unit test
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/gengliangwang/spark avro_io
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/21762.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #21762
----
commit bb7a43c8f3e34c90ebe8f0e22019c096776b6da3
Author: Gengliang Wang <gengliang.wang@...>
Date: 2018-07-13T08:18:12Z
refactor avro Serializer and Deserializer
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]