Hi everyone,

I would like to start the discussion for *KIP-1267*: Tiered Storage Cost
Attribution Metrics.

*KIP:*
https://cwiki.apache.org/confluence/display/KAFKA/KIP-1267%3A+Tiered+Storage+Cost+Attribution+Metrics


*Jira:*https://issues.apache.org/jira/browse/KAFKA-20047

Tiered Storage (KIP-405) has been a great addition for managing storage
costs, but it introduces variable costs for data access (GET requests and
egress). Currently, existing metrics only track remote fetches at the topic
level.

In a multi-tenant cluster, if a specific consumer group triggers a massive
amount of historical reads from S3, there is no easy way to identify them
using current metrics.

This KIP proposes adding client-id tags to the remote fetch metrics in the
RemoteLogManager. This enables operators to attribute remote fetch costs to
specific consumers for chargeback and governance purposes.

Looking forward to your feedback.

Regards,

Viquar khan

https://www.linkedin.com/in/vaquar-khan-b695577/

Reply via email to