Lixin Yao created KAFKA-15214:
---------------------------------

             Summary: Add metrics for OffsetOutOfRangeException when tiered 
storage is enabled
                 Key: KAFKA-15214
                 URL: https://issues.apache.org/jira/browse/KAFKA-15214
             Project: Kafka
          Issue Type: Improvement
          Components: metrics
    Affects Versions: 3.6.0
            Reporter: Lixin Yao
             Fix For: 3.6.0


In the current metrics RemoteReadErrorsPerSec, the exception type 
OffsetOutOfRangeException is not included.


In our testing with tiered storage feature, we noticed several cases where 
remote download is affected and stuck due to repeatedly 
OffsetOutOfRangeException in some particular broker or topic partitions. The 
root cause could be various but currently without a metrics it's very hard to 
catch this issue and debug in a timely fashion. It's understandable that the 
exception itself could not be the root cause but this exception metric could be 
a good metrics for us to alert and investigate.

Related discussion
[https://github.com/apache/kafka/pull/13944#discussion_r1266243006]

I am happy to contribute to this if the request is agreed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to