Matthias J. Sax created KAFKA-7294:
--------------------------------------

             Summary: Optimize repartitioning for merge()
                 Key: KAFKA-7294
                 URL: https://issues.apache.org/jira/browse/KAFKA-7294
             Project: Kafka
          Issue Type: Improvement
          Components: streams
            Reporter: Matthias J. Sax


For a merge() operator we check at compile time, if one of the input KStreams 
requires repartitioning and set the "requiresRepartitioning" flag for the 
output  KStream for this case. This implies, that we pipe data from all input 
KStreams through the repartition topic after the merge().

Using our optimizer, we could push down the repartition operation before the 
merge() to only repartition the KStream(s) that required repartition and thus 
save network IO for all KStreams that don't require repartitioning.

Note, that this operation is only correct, if all input streams are 
co-partitioned (cf. KAFKA-7293).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to