Matthias J. Sax created KAFKA-4718:
--------------------------------------

             Summary: Revisit DSL partitioning assumption for KStream source 
topics
                 Key: KAFKA-4718
                 URL: https://issues.apache.org/jira/browse/KAFKA-4718
             Project: Kafka
          Issue Type: Improvement
          Components: streams
            Reporter: Matthias J. Sax
            Priority: Minor


Currently, when reading one or multiple topics via a single call to 
{{KStreamBuilder#stream()}}, it is assumed that the data is correctly 
partitions by key.

For "single topic" {{KStream}}, this is a fair assumption, however, for 
multi-topic {{KStream}}, the assumption is most likely not true if input topics 
have a different number of partitions, because producer use hash partitioning 
per default. Thus, to get correct partitions, all producer for those input 
topics need to use (the same or at least a compatible) custom partitioner.

Making this the default assumption seem rather risky, and we might want to 
revisit this. Or at least update some docs with corresponding hints.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to