[ 
https://issues.apache.org/jira/browse/BEAM-3337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Groh updated BEAM-3337:
------------------------------
    Description: 
This represents the entirety of a Pipeline, divided into executable stages 
which are executed in either the runner or within a user container. This 
representation must include the ports over which the SDK harness communicates 
to the runner.

The construction of this graph likely includes most of the nodes present within 
the Runner API graph (PTransform and PCollection), but injects additional nodes 
to represent a remote read or write between harnesses.

Simple fusion (naive producer/consumer and sibling fusion) also should be 
performed here.

This will also require rewriting some boundary coders (e.g. runner -> SDK 
Harness and vice versa) to be agnostic to the language of the runner harness, 
likely by converting into length-prefixed bytes.

  was:
This represents the entirety of a Pipeline, divided into executable stages 
which are executed in either the runner or within a user container. This 
representation must include the ports over which the SDK harness communicates 
to the runner.

The construction of this graph likely includes most of the nodes present within 
the Runner API graph (PTransform and PCollection), but injects additional nodes 
to represent a remote read or write between harnesses.


> Create an Executable Graph representation in the Universal Local Runner
> -----------------------------------------------------------------------
>
>                 Key: BEAM-3337
>                 URL: https://issues.apache.org/jira/browse/BEAM-3337
>             Project: Beam
>          Issue Type: Bug
>          Components: runner-core
>            Reporter: Thomas Groh
>              Labels: portability
>
> This represents the entirety of a Pipeline, divided into executable stages 
> which are executed in either the runner or within a user container. This 
> representation must include the ports over which the SDK harness communicates 
> to the runner.
> The construction of this graph likely includes most of the nodes present 
> within the Runner API graph (PTransform and PCollection), but injects 
> additional nodes to represent a remote read or write between harnesses.
> Simple fusion (naive producer/consumer and sibling fusion) also should be 
> performed here.
> This will also require rewriting some boundary coders (e.g. runner -> SDK 
> Harness and vice versa) to be agnostic to the language of the runner harness, 
> likely by converting into length-prefixed bytes.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to