[ 
https://issues.apache.org/jira/browse/HUDI-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17406157#comment-17406157
 ] 

ASF GitHub Bot commented on HUDI-2320:
--------------------------------------

pratyakshsharma commented on pull request #3502:
URL: https://github.com/apache/hudi/pull/3502#issuecomment-907596058


   @dongkelun So essentially I wanted to understand the reason behind using 
ByteArraySerDes. Because I have not seen this getting used so far in my working 
experience. Mostly StringSerializer or confluent Avro Serializer are the most 
common ones to use. So just wanted to know if one gets any additional benefit 
with using ByteArraySerializer in terms of data size etc.
   
   LGTM otherwise. :) cc @yanghua 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


> Add support ByteArrayDeserializer in AvroKafkaSource
> ----------------------------------------------------
>
>                 Key: HUDI-2320
>                 URL: https://issues.apache.org/jira/browse/HUDI-2320
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: DeltaStreamer
>            Reporter: 董可伦
>            Assignee: 董可伦
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.10.0
>
>
> When the 'value.serializer' of Kafka Avro Producer is 
> 'org.apache.kafka.common.serialization.ByteArraySerializer',Use the following 
> configuration
> {code:java}
> --source-class org.apache.hudi.utilities.sources.AvroKafkaSource \
> --schemaprovider-class 
> org.apache.hudi.utilities.schema.JdbcbasedSchemaProvider \
> --hoodie-conf 
> "hoodie.deltastreamer.source.kafka.value.deserializer.class=org.apache.kafka.common.serialization.ByteArrayDeserializer"
> {code}
> For now,It will throw an exception::
> {code:java}
> java.lang.ClassCastException: [B cannot be cast to 
> org.apache.avro.generic.GenericRecord{code}
> After support ByteArrayDeserializer,Use the configuration above,It works 
> properly.And there is no need to provide 'schema.registry.url',For example, 
> we can use the JdbcbasedSchemaProvider to get the sourceSchema



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to