[
https://issues.apache.org/jira/browse/KAFKA-15214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Colin McCabe updated KAFKA-15214:
---------------------------------
Fix Version/s: 4.0.0
(was: 3.9.0)
> Add metrics for OffsetOutOfRangeException when tiered storage is enabled
> ------------------------------------------------------------------------
>
> Key: KAFKA-15214
> URL: https://issues.apache.org/jira/browse/KAFKA-15214
> Project: Kafka
> Issue Type: Task
> Components: metrics
> Affects Versions: 3.6.0
> Reporter: Lixin Yao
> Priority: Minor
> Labels: KIP-405
> Fix For: 4.0.0
>
>
> In the current metrics RemoteReadErrorsPerSec, the exception type
> OffsetOutOfRangeException is not included.
> In our testing with tiered storage feature (at Apple), we noticed several
> cases where remote download is affected and stuck due to repeatedly
> OffsetOutOfRangeException in some particular broker or topic partitions. The
> root cause could be various but currently without a metrics it's very hard to
> catch this issue and debug in a timely fashion. It's understandable that the
> exception itself could not be the root cause but this exception metric could
> be a good metrics for us to alert and investigate.
> Related discussion
> [https://github.com/apache/kafka/pull/13944#discussion_r1266243006]
> I am happy to contribute to this if the request is agreed.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)