Patrik Kleindl created KAFKA-19341:
--------------------------------------
Summary: Execution of HighWatermarkUpdate failed
Key: KAFKA-19341
URL: https://issues.apache.org/jira/browse/KAFKA-19341
Project: Kafka
Issue Type: Bug
Components: group-coordinator
Affects Versions: 4.0.0
Reporter: Patrik Kleindl
We got the following Exception multiple times in our logs when a client showed
problems with the group coordinator:
{code:java}
[ERROR] 2025-05-27 02:18:51,623 [group-coordinator-event-processor-0]
org.apache.kafka.coordinator.group.runtime.CoordinatorRuntime complete -
[GroupCoordinator id=2] Execution of HighWatermarkUpdate failed due to value
45050145 outside of histogram covered range. Caused by:
java.lang.ArrayIndexOutOfBoundsException: Index 16734 out of bounds for length
7168.
java.lang.ArrayIndexOutOfBoundsException: value 45050145 outside of histogram
covered range. Caused by: java.lang.ArrayIndexOutOfBoundsException: Index 16734
out of bounds for length 7168
at
org.HdrHistogram.AbstractHistogram.handleRecordException(AbstractHistogram.java:571)
at
org.HdrHistogram.AbstractHistogram.recordSingleValue(AbstractHistogram.java:563)
at
org.HdrHistogram.AbstractHistogram.recordValue(AbstractHistogram.java:467)
at org.HdrHistogram.Recorder.recordValue(Recorder.java:136)
at
org.apache.kafka.coordinator.group.metrics.HdrHistogram.record(HdrHistogram.java:98)
at
org.apache.kafka.coordinator.group.metrics.KafkaMetricHistogram.record(KafkaMetricHistogram.java:128)
at org.apache.kafka.common.metrics.Sensor.recordInternal(Sensor.java:237)
at org.apache.kafka.common.metrics.Sensor.record(Sensor.java:198)
at
org.apache.kafka.coordinator.group.metrics.GroupCoordinatorRuntimeMetrics.recordEventPurgatoryTime(GroupCoordinatorRuntimeMetrics.java:301)
at
org.apache.kafka.coordinator.group.runtime.CoordinatorRuntime$CoordinatorWriteEvent.complete(CoordinatorRuntime.java:1362)
at
org.apache.kafka.deferred.DeferredEventQueue.completeUpTo(DeferredEventQueue.java:63)
at
org.apache.kafka.coordinator.group.runtime.CoordinatorRuntime$HighWatermarkListener.lambda$onHighWatermarkUpdated$0(CoordinatorRuntime.java:1802)
at
org.apache.kafka.coordinator.group.runtime.CoordinatorRuntime$CoordinatorInternalEvent.run(CoordinatorRuntime.java:1723)
at
org.apache.kafka.coordinator.group.runtime.MultiThreadedEventProcessor$EventProcessorThread.handleEvents(MultiThreadedEventProcessor.java:148)
at
org.apache.kafka.coordinator.group.runtime.MultiThreadedEventProcessor$EventProcessorThread.run(MultiThreadedEventProcessor.java:180){code}
We are running Confluent Platform 7.9 which should be based on Apache Kafka
3.9, but this Exception should only be present in Kafka 4.0.
I will create a ticket with Confluent, but as this code is part of Apache Kafka
itself it could probably affect others too.
If I understand the exception the HighWatermarkUpdate operation itself was
successful but the problem is caused by writing the metrics.
After a restart of the cluster and the client the problem was resolved, but it
didn't show up right after the last update or changes.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)