Hi everyone, I would like to start the discussion for *KIP-1267*: Tiered Storage Cost Attribution Metrics.
*KIP:* https://cwiki.apache.org/confluence/display/KAFKA/KIP-1267%3A+Tiered+Storage+Cost+Attribution+Metrics *Jira:*https://issues.apache.org/jira/browse/KAFKA-20047 Tiered Storage (KIP-405) has been a great addition for managing storage costs, but it introduces variable costs for data access (GET requests and egress). Currently, existing metrics only track remote fetches at the topic level. In a multi-tenant cluster, if a specific consumer group triggers a massive amount of historical reads from S3, there is no easy way to identify them using current metrics. This KIP proposes adding client-id tags to the remote fetch metrics in the RemoteLogManager. This enables operators to attribute remote fetch costs to specific consumers for chargeback and governance purposes. Looking forward to your feedback. Regards, Viquar khan https://www.linkedin.com/in/vaquar-khan-b695577/
