[ 
https://issues.apache.org/jira/browse/KAFKA-2406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14658685#comment-14658685
 ] 

Jun Rao commented on KAFKA-2406:
--------------------------------

[~becket_qin], another way to reduce the overhead is to batch on the broker 
side. Every time a broker changes an ISR, it just saves the partition in an 
in-memory map. Periodically, the replica manager can collect all partitions in 
the map and write a single node in the ISR change path. This should reduce the 
number of UpdateMetadataRequests the controller sends to the brokers. It also 
reduces the number of ZK nodes to be deleted and the number of times the ISR 
change watchers are triggered.

Also, could you provide a bit more details on the performance issue? Does that 
issue happen when shutting down the first broker in the cluster or when there 
is another broker just being restarted? It will also be good know where the 
bottleneck is: whether just in the number of UpdateMetadataRequests or in 
processing the watchers as well.

> ISR propagation should be throttled to avoid overwhelming controller.
> ---------------------------------------------------------------------
>
>                 Key: KAFKA-2406
>                 URL: https://issues.apache.org/jira/browse/KAFKA-2406
>             Project: Kafka
>          Issue Type: Bug
>            Reporter: Jiangjie Qin
>            Assignee: Jiangjie Qin
>            Priority: Blocker
>
> This is a follow up patch for KAFKA-1367.
> We need to throttle the ISR propagation rate to avoid flooding in controller 
> to broker traffic. This might significantly increase time of controlled 
> shutdown or cluster startup.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to