[ 
https://issues.apache.org/jira/browse/KAFKA-15214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17744334#comment-17744334
 ] 

Divij Vaidya commented on KAFKA-15214:
--------------------------------------

Hi Lixin
Can you help us understand more about the motivation (perhaps by explaining an 
example scenario where this metric would be useful)? Also, may want to consider 
adding specific exception type as a "tag" to the existing error metric.

> Add metrics for OffsetOutOfRangeException when tiered storage is enabled
> ------------------------------------------------------------------------
>
>                 Key: KAFKA-15214
>                 URL: https://issues.apache.org/jira/browse/KAFKA-15214
>             Project: Kafka
>          Issue Type: Improvement
>          Components: metrics
>    Affects Versions: 3.6.0
>            Reporter: Lixin Yao
>            Priority: Minor
>              Labels: KIP-405
>             Fix For: 3.6.0
>
>
> In the current metrics RemoteReadErrorsPerSec, the exception type 
> OffsetOutOfRangeException is not included.
> In our testing with tiered storage feature (at Apple), we noticed several 
> cases where remote download is affected and stuck due to repeatedly 
> OffsetOutOfRangeException in some particular broker or topic partitions. The 
> root cause could be various but currently without a metrics it's very hard to 
> catch this issue and debug in a timely fashion. It's understandable that the 
> exception itself could not be the root cause but this exception metric could 
> be a good metrics for us to alert and investigate.
> Related discussion
> [https://github.com/apache/kafka/pull/13944#discussion_r1266243006]
> I am happy to contribute to this if the request is agreed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to