[ 
https://issues.apache.org/jira/browse/BEAM-7029?focusedWorklogId=225662&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-225662
 ]

ASF GitHub Bot logged work on BEAM-7029:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 10/Apr/19 16:12
            Start Date: 10/Apr/19 16:12
    Worklog Time Spent: 10m 
      Work Description: lukecwik commented on issue #8251: [BEAM-7029] Add 
KafkaIO.Read as external transform
URL: https://github.com/apache/beam/pull/8251#issuecomment-481756803
 
 
   @mxm Supporting the whole KafkaRecord could be a TODO for the future. For 
now you could just produce KV<byte[], byte[]> with the intent that the 
receiving end performs any Kafka key/value -> language specific type decoding 
(instead of passing in the key / value deserializers to the Java SDK). This 
will produce a version that has a lot of value for many people since all "data" 
types can be supported in the Python SDK by exposing a UDF that users can 
implement to perform the decoding in the Python SDK. You could produce some 
defaults for things like bytes/utf8/dates/...
   
   Also, to support the whole KafkaRecord in the future, Kafka must already 
have a well defined "encoding" format for compatibility across all its client 
SDKs so it would be great if we could use that and then use the parsing in the 
Kafka client libraries to do that work. Our coder spec would be 
beam:coder:kafka_record:v1 and just point to a certain version of the 
KafkaRecord spec.
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 225662)
    Time Spent: 7h  (was: 6h 50m)

> Support KafkaIO to be configured externally for use with other SDKs
> -------------------------------------------------------------------
>
>                 Key: BEAM-7029
>                 URL: https://issues.apache.org/jira/browse/BEAM-7029
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-java-kafka, runner-flink, sdk-py-core
>            Reporter: Maximilian Michels
>            Assignee: Maximilian Michels
>            Priority: Major
>          Time Spent: 7h
>  Remaining Estimate: 0h
>
> As of BEAM-6730, we can externally configure existing transforms from SDKs. 
> We should add more useful transforms then just {{GenerateSequence}}. 
> {{KafkaIO}} is a good candidate.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to