[ 
https://issues.apache.org/jira/browse/BEAM-7029?focusedWorklogId=225055&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-225055
 ]

ASF GitHub Bot logged work on BEAM-7029:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 09/Apr/19 15:37
            Start Date: 09/Apr/19 15:37
    Worklog Time Spent: 10m 
      Work Description: chamikaramj commented on issue #8251: [BEAM-7029] Add 
KafkaIO.Read as external transform
URL: https://github.com/apache/beam/pull/8251#issuecomment-481304996
 
 
   @lukecwik I summarized the current design (adapter from Max's original 
design doc) in an email to dev list: 
https://lists.apache.org/thread.html/b99ba8527422e31ec7bb7ad9dc3a6583551ea392ebdc5527b5fb4a67@%3Cdev.beam.apache.org%3E
   
   The idea is to minimize the amount of work a transform author has to do to 
make a given transform available to remote SDKs. With the current solution, the 
transform authors simply have to provide a builder that can be used to build 
the transform from a given config object (POJO). Framework will generate a 
proto from this that defines the wire format. Remote SDKs can invoke arbitrary 
transforms that are exposed through the expansion API (and has a builder 
available) by using generic ExternalTransform. Also, transform authors may 
define nicer wrappers (of ExternalTransform) if they choose to do so.
   
   Of course this means that a cross-SDK interaction can only be done thorough 
standard coders that are defined for both languages. These will be used for (1) 
encoding parameters used to build the rewmote transform (2) encoding data 
transferred between SDKs at runtime. A pipeline author has to be aware of this 
and has to make sure that transforms at SDK boundaries use standard coders. We 
hope the solution will become more generic once Beam SQL becomes available 
(where we should be able to do a Row to Row conversion from SDK A to SDK B for 
generic types).
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 225055)
    Time Spent: 3.5h  (was: 3h 20m)

> Support KafkaIO to be configured externally for use with other SDKs
> -------------------------------------------------------------------
>
>                 Key: BEAM-7029
>                 URL: https://issues.apache.org/jira/browse/BEAM-7029
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-java-kafka, runner-flink, sdk-py-core
>            Reporter: Maximilian Michels
>            Assignee: Maximilian Michels
>            Priority: Major
>          Time Spent: 3.5h
>  Remaining Estimate: 0h
>
> As of BEAM-6730, we can externally configure existing transforms from SDKs. 
> We should add more useful transforms then just {{GenerateSequence}}. 
> {{KafkaIO}} is a good candidate.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to