[
https://issues.apache.org/jira/browse/HUDI-837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sivabalan narayanan updated HUDI-837:
-------------------------------------
Labels: bug-bash-0.6.0 pull-request-available (was: pull-request-available)
> Fix AvroKafkaSource to use the latest schema for reading
> --------------------------------------------------------
>
> Key: HUDI-837
> URL: https://issues.apache.org/jira/browse/HUDI-837
> Project: Apache Hudi (incubating)
> Issue Type: Improvement
> Components: DeltaStreamer
> Reporter: Pratyaksh Sharma
> Assignee: Pratyaksh Sharma
> Priority: Major
> Labels: bug-bash-0.6.0, pull-request-available
> Fix For: 0.6.0
>
>
> Currently we specify KafkaAvroDeserializer as the value for
> value.deserializer in AvroKafkaSource. This implies the published record is
> read using the same schema with which it was written even though the schema
> got evolved in between. As a result, messages in incoming batch can have
> different schemas. This has to be handled at the time of actually writing
> records in parquet.
> This Jira aims at providing an option to read all the messages with the same
> schema by implementing a new custom deserializer class.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)