Guozhang Wang created KAFKA-3559:
------------------------------------

             Summary: Task creation time taking too long in rebalance callback
                 Key: KAFKA-3559
                 URL: https://issues.apache.org/jira/browse/KAFKA-3559
             Project: Kafka
          Issue Type: Bug
          Components: streams
            Reporter: Guozhang Wang
             Fix For: 0.10.0.0


Currently in Kafka Streams, we create stream tasks upon getting newly assigned 
partitions in rebalance callback function {code} onPartitionAssigned {code}, 
which involves initialization of the processor state stores as well (including 
opening the rocksDB, restore the store from changelog, etc, which takes time).

With a large number of state stores, the initialization time itself could take 
tens of seconds, which usually is larger than the consumer session timeout. As 
a result, when the callback is completed, the consumer is already treated as 
failed by the coordinator and rebalance again.

We need to consider if we can optimize the initialization process, or move it 
out of the callback function, and while initializing the stores one-by-one, use 
poll call to send heartbeats to avoid being kicked out by coordinator.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to