Henning Rohde created BEAM-4826:
-----------------------------------

             Summary: Flink runner sends bad flatten to SDK
                 Key: BEAM-4826
                 URL: https://issues.apache.org/jira/browse/BEAM-4826
             Project: Beam
          Issue Type: Bug
          Components: runner-flink
            Reporter: Henning Rohde
            Assignee: Aljoscha Krettek


For a Go flatten test w/ 3 input, the Flink runner splits this into 3 bundle 
descriptors. But it sends the original 3-input flatten but w/ 1 actual input 
present in each bundle descriptor. This is inconsistent and the SDK shouldn't 
expect dangling PCollections. In contrast, Dataflow removes the flatten when it 
does the same split.

Snippet:

register: <
  process_bundle_descriptor: <
    id: "3"
    transforms: <
      key: "e4"
      value: <
        unique_name: "github.com/apache/beam/sdks/go/pkg/beam.createFn'1"
        spec: <
          urn: "urn:beam:transform:pardo:v1"
          payload: [...]
        >
        inputs: <
          key: "i0"
          value: "n3"
        >
        outputs: <
          key: "i0"
          value: "n4"
        >
      >
    >
    transforms: <
      key: "e7"
      value: <
        unique_name: "Flatten"
        spec: <
          urn: "beam:transform:flatten:v1"
        >
        inputs: <
          key: "i0"
          value: "n2"
        >
        inputs: <
          key: "i1"
          value: "n4" . // <----------- only one present.
        >
        inputs: <
          key: "i2"
          value: "n6"
        >
        outputs: <
          key: "i0"
          value: "n7"
        >
      >
    >
[...]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to