Indefinite growth of FetchRequestPurgatory

András Serény Mon, 03 Nov 2014 05:44:06 -0800

Hi Kafka users,

we're running a cluster of two Kafka 0.8.1.1 brokers, with a twofoldreplicaton of each topic.

When both brokers are up, after a short while the FetchRequestPurgatorystarts to grow indefinitely on the leader (detectable via a heap dumpand also via the "FetchRequestPurgatory"."PurgatorySize" JMX metric),eventually leading to an OOM error. When one of the brokers is shutdown, the purgatory stops growing in size, and the remaining broker runsfine. In https://issues.apache.org/jira/browse/KAFKA-1016, I see thiscan occur when a fetcher specifies a too large max wait time, but wedon't override replica.fetch.wait.max.ms, leaving it at the default 500 ms.


Do you have any suggestions what can be the cause and how to fix it?

Thanks a lot,
András

Indefinite growth of FetchRequestPurgatory

Reply via email to