Genmao Yu created KAFKA-5928:
--------------------------------
Summary: Avoid redundant requests to zookeeper when reassign topic
partition
Key: KAFKA-5928
URL: https://issues.apache.org/jira/browse/KAFKA-5928
Project: Kafka
Issue Type: Improvement
Components: admin
Affects Versions: 0.11.0.0, 0.10.2.1
Reporter: Genmao Yu
We mistakenly request topic level information according to partitions config in
the assignment json file. For example
https://github.com/apache/kafka/blob/trunk/core/src/main/scala/kafka/admin/ReassignPartitionsCommand.scala#L550:
{code}
val validPartitions = proposedPartitionAssignment.filter { case (p, _) =>
validatePartition(zkUtils, p.topic, p.partition) }
{code}
If reassign 1000 partitions (in 10 topics), we need to request zookeeper 1000
times here. But actually we only need to request just 10 (topics) times. We
test a large-scale assignment, about 10K partitions. It takes tens of minutes.
After optimization, it will reduce to less than 1minute.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)