[ 
https://issues.apache.org/jira/browse/FLINK-22840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Flink Jira Bot updated FLINK-22840:
-----------------------------------
    Labels: performance stale-minor  (was: performance)

I am the [Flink Jira Bot|https://github.com/apache/flink-jira-bot/] and I help 
the community manage its development. I see this issues has been marked as 
Minor but is unassigned and neither itself nor its Sub-Tasks have been updated 
for 180 days. I have gone ahead and marked it "stale-minor". If this ticket is 
still Minor, please either assign yourself or give an update. Afterwards, 
please remove the label or in 7 days the issue will be deprioritized.


> Assign-evenly kafkaTopicPartitions of multiple topics to flinkKafkaConsumer 
> subtask 
> ------------------------------------------------------------------------------------
>
>                 Key: FLINK-22840
>                 URL: https://issues.apache.org/jira/browse/FLINK-22840
>             Project: Flink
>          Issue Type: Improvement
>          Components: Connectors / Kafka
>    Affects Versions: 1.11.0
>            Reporter: Xu xiaolong
>            Priority: Minor
>              Labels: performance, stale-minor
>   Original Estimate: 12h
>  Remaining Estimate: 12h
>
> Now ,with flink1.11 kafka connecotr,when we consume multiple kafka topics by 
> one flinkkafkaconsumer , we want to set the consumer parallelism equals with 
> the total partitions count of multiple topic , with a decision that each 
> topic partition consume by one kafka consumer, so each topic partition count 
> is less than the subtask count. But,currently there is one problem , some 
> subtask is total free while someothers workload is very high, this cause is 
> the partitionAssigner assign partion of earch topic indepently.
> Following is one example: Target topics: topi1, topic2 ,topic3 ,topic4.  each 
> has 3 partitions. In our job we consume the 4 topic by one consumer , our 
> flink standalone cluster got 9 taskworkers on different nodes. we want 
> balance the workload as much as possible, so we  set the paralelism of 
> flinkkafkaconsumer to 12. from the UI we notice that the 0-5 subtask is free 
> without partition assigned, the total 12 partiton is assigned to 6-11 
> subtask. We learned the source code of KafkaTopicPartitionAssigner to explain 
> this phenomenon ,and then we extend one more partition assign strategy which 
> can deal with the need we describe up, this stategy can evenly assign 
> partiton from multiple topic grobally to subtask of consumer. we want to 
> contibute to flink, so someone has the same requirement can use it directlly.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to