Robert, Flink version 1.12.2. Flink connector Kafka Version 2..12
The partitions are assigned equally if we are reading from a single topic. Our Use case is to read from multiple topics [topics r4 regex pattern] we use 6 topics and 1 partition per topic for this job. In this case , few of the kafka consumer tasks are not allocated. Thanks, Prasanna. On Tue, 20 Jul 2021, 17:44 Robert Metzger, <rmetz...@apache.org> wrote: > Hi Prasanna, > which Flink version and Kafka connector are you using? (the "KafkaSource" > or "FlinkKafkaConsumer"?) > > The partition assignment for the FlinkKafkaConsumer is defined here: > https://github.com/apache/flink/blob/master/flink-connectors/flink-connector-kafka/src/main/java/org/apache/flink/streaming/connectors/kafka/internals/KafkaTopicPartitionAssigner.java#L27-L43 > > > I assume all your topics have one partition only. Still, the "startIndex" > should be determined based on the hash of the topic name. My only > explanation is that your unlucky with the distribution of the hashes. > If this leads to performance issues, consider using topics with multiple > partitions, change the name of the topics or increase the parallelism of > your consumer. > > > > > On Tue, Jul 20, 2021 at 7:53 AM Prasanna kumar < > prasannakumarram...@gmail.com> wrote: > >> Hi, >> >> We have a Flink job reading from multiple Kafka topics based on a regex >> pattern. >> >> What we have found out is that the topics are not shared between the >> kafka consumers in an even manner . >> >> Example if there are 8 topics and 4 kafka consumer operators . 1 >> consumer is assigned 6 topics , 2 consumers assigned 1 each and the last >> consumer is not assigned at all. >> >> This leads to inadequate usage of the resources. >> >> I could not find any setting/configuration which would make them as even >> as possible. >> >> Let me know if there's a way to do the same. >> >> Thanks, >> Prasanna. >> >