David Morávek created BEAM-10164:
------------------------------------

             Summary: Flink: Memory efficient combine implementation for batch 
runner
                 Key: BEAM-10164
                 URL: https://issues.apache.org/jira/browse/BEAM-10164
             Project: Beam
          Issue Type: Bug
          Components: runner-flink
            Reporter: David Morávek
            Assignee: David Morávek


Current Combine implementation assumes that all input values for a single key 
(on both map and reduce side) fit in memory as it needs to sort them by window 
before combining.

We can easily optimize this for non-merging windows by pre-grouping elements by 
(K, W) tuples.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to