Dong Lin created KAFKA-5335:
-------------------------------
Summary: Controller should batch updatePartitionReassignmentData()
operation
Key: KAFKA-5335
URL: https://issues.apache.org/jira/browse/KAFKA-5335
Project: Kafka
Issue Type: Bug
Reporter: Dong Lin
Assignee: Dong Lin
Currently controller will update partition reassignment data every time a
partition in the reassignment is completed. It means that if user specifies a
huge reassignment znode of size 1 MB to move 10K partitions, controller will
need to write roughly 0.5 MB * 10000 = 5 GB data to zookeeper in order to
complete this reassignment. This is because controller needs to write the
remaining partitions to the znode every time a partition is completely moved.
This is problematic because such a huge reassignment may greatly slow down
Kafka controller. Note that partition reassignment doesn't necessarily cause
data movement between brokers because we may use it only to recorder the
replica list of partitions to evenly distribute preferred leader.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)