[
https://issues.apache.org/jira/browse/BEAM-7029?focusedWorklogId=225352&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-225352
]
ASF GitHub Bot logged work on BEAM-7029:
----------------------------------------
Author: ASF GitHub Bot
Created on: 10/Apr/19 00:13
Start Date: 10/Apr/19 00:13
Worklog Time Spent: 10m
Work Description: chamikaramj commented on issue #8251: [BEAM-7029] Add
KafkaIO.Read as external transform
URL: https://github.com/apache/beam/pull/8251#issuecomment-481485803
Makes sense in general. If a IO connector offers a transform that produces
records as bytes, it makes sense to support pushing deserializers to remote
SDKs.
Looks like there's no such a read transform for Java `KafkaIO` today though.
`KafkaIO.TypedWithoutMetadata` produces a `PCollection<KV<K, V>>` while
`KafkIO.Read` produces a `PCollection<KafkaRecord<K, V>>` where a `KafkaRecord`
wraps` K`, `V` values (and other metadata) extracted from a native Kafka
record. Also, `KafkIO.Read` will not work for cross-language transforms till
we convert it to use SDF (instead of `UnboundedSource`).
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 225352)
Time Spent: 5h 20m (was: 5h 10m)
> Support KafkaIO to be configured externally for use with other SDKs
> -------------------------------------------------------------------
>
> Key: BEAM-7029
> URL: https://issues.apache.org/jira/browse/BEAM-7029
> Project: Beam
> Issue Type: New Feature
> Components: io-java-kafka, runner-flink, sdk-py-core
> Reporter: Maximilian Michels
> Assignee: Maximilian Michels
> Priority: Major
> Time Spent: 5h 20m
> Remaining Estimate: 0h
>
> As of BEAM-6730, we can externally configure existing transforms from SDKs.
> We should add more useful transforms then just {{GenerateSequence}}.
> {{KafkaIO}} is a good candidate.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)