[
https://issues.apache.org/jira/browse/BEAM-10759?focusedWorklogId=490307&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-490307
]
ASF GitHub Bot logged work on BEAM-10759:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 24/Sep/20 16:39
Start Date: 24/Sep/20 16:39
Worklog Time Spent: 10m
Work Description: dennisylyung commented on pull request #12630:
URL: https://github.com/apache/beam/pull/12630#issuecomment-698457493
@iemejia do you have time to review the updates?
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 490307)
Time Spent: 1h 10m (was: 1h)
> KafkaIO with Avro deserializer fails with evolved schema
> --------------------------------------------------------
>
> Key: BEAM-10759
> URL: https://issues.apache.org/jira/browse/BEAM-10759
> Project: Beam
> Issue Type: Bug
> Components: io-java-kafka
> Affects Versions: 2.23.0
> Reporter: Dennis Yung
> Assignee: Dennis Yung
> Priority: P2
> Time Spent: 1h 10m
> Remaining Estimate: 0h
>
> When using KafkaIO with ConfluentSchemaRegistryDeserializerProvider,
> exception could be thrown when consuming a topic with evolved schema.
> It is because when the DeserializerProvider is initialized, it create a
> AvroCoder instance using either the latest Avro schema by default, or a
> specific version of provided.
> If the Kafka topic contains records with multiple schema versions, AvroCoder
> will fail to encode records with different schemas. The specific exception
> differs depending on the schema change. For example, I have encountered type
> cast error and null pointer error.
> To fix this issue, we can make use of the writer-reader schema arguments from
> Avro to deserialize Kafka records to the same schema with the AvroCoder. The
> method is available in io.confluent.kafka.serializers.KafkaAvroDeserializer
> {code:java}
> public Object deserialize(String s, byte[] bytes, Schema readerSchema) {
> return this.deserialize(bytes, readerSchema);
> }
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)