fujian created KAFKA-19893:
------------------------------
Summary: Reduce redundant storage in remote tier (configurable)
Key: KAFKA-19893
URL: https://issues.apache.org/jira/browse/KAFKA-19893
Project: Kafka
Issue Type: Improvement
Components: Tiered-Storage
Reporter: fujian
Assignee: fujian
Attachments: image-2025-11-18-19-11-49-498.png
Currently, Kafka uploads all non-active local log segments to remote storage
even when they are still within the local retention period, resulting in
redundant storage of the same data in both tiers. This wastes storage capacity
and network bandwidth without providing immediate benefits, since reads during
the retention window prioritize local data.
However, some users rely on remote storage for real-time analytics and need the
latest data to be available as soon as possible. Therefore, this optimization
is offered as an optional configuration rather than the default behavior.
!image-2025-11-18-19-12-01-306.png!
--
This message was sent by Atlassian Jira
(v8.20.10#820010)