[
https://issues.apache.org/jira/browse/BEAM-7029?focusedWorklogId=225543&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-225543
]
ASF GitHub Bot logged work on BEAM-7029:
----------------------------------------
Author: ASF GitHub Bot
Created on: 10/Apr/19 13:12
Start Date: 10/Apr/19 13:12
Worklog Time Spent: 10m
Work Description: mxm commented on issue #8251: [BEAM-7029] Add
KafkaIO.Read as external transform
URL: https://github.com/apache/beam/pull/8251#issuecomment-481684635
@lukecwik That's correct. During implementation I discovered that data is
always wrapped into `KafakRecord` and went with `Read.TypedWithoutMetaData` to
get around this. Like you pointed out, we need either 1) a `KafkaRecord`
standard coder (perhaps something applicable to other messaging systems as well
or 2) `byte[]` where we implement all coding in the SDK. For both we need to
provide Coders in the SDK, but the benefit of having a standard coder would be
a language-independent encoding scheme which is important when dealing with
cross-language pipelines.
> Also, KafkIO.Read will not work for cross-language transforms till we
convert it to use SDF (instead of UnboundedSource).
I think we will have to support Read in portable Runners for now until SDF
is finalized and an adapter for UnboundedSource is available. For the Flink
Runner this is already the case and other Runners can add it easily. It's not
so bad because, after all, it is transparent to the user.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 225543)
Time Spent: 5h 40m (was: 5.5h)
> Support KafkaIO to be configured externally for use with other SDKs
> -------------------------------------------------------------------
>
> Key: BEAM-7029
> URL: https://issues.apache.org/jira/browse/BEAM-7029
> Project: Beam
> Issue Type: New Feature
> Components: io-java-kafka, runner-flink, sdk-py-core
> Reporter: Maximilian Michels
> Assignee: Maximilian Michels
> Priority: Major
> Time Spent: 5h 40m
> Remaining Estimate: 0h
>
> As of BEAM-6730, we can externally configure existing transforms from SDKs.
> We should add more useful transforms then just {{GenerateSequence}}.
> {{KafkaIO}} is a good candidate.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)