[ 
https://issues.apache.org/jira/browse/KAFKA-19893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

fujian updated KAFKA-19893:
---------------------------
    Description: 
Currently, Kafka uploads all non-active local log segments to remote storage 
even when they are still within the local retention period, resulting in 
redundant storage of the same data in both tiers. This wastes storage capacity 
and network bandwidth without providing immediate benefits, since reads during 
the retention window prioritize local data.

However, some users rely on remote storage for real-time analytics and need the 
latest data to be available as soon as possible. Therefore, this optimization 
is offered as an optional configuration rather than the default behavior.

!image-2025-11-18-19-19-27-579.png!

 

  was:
Currently, Kafka uploads all non-active local log segments to remote storage 
even when they are still within the local retention period, resulting in 
redundant storage of the same data in both tiers. This wastes storage capacity 
and network bandwidth without providing immediate benefits, since reads during 
the retention window prioritize local data.

However, some users rely on remote storage for real-time analytics and need the 
latest data to be available as soon as possible. Therefore, this optimization 
is offered as an optional configuration rather than the default behavior.

 


> Reduce redundant storage in remote tier (configurable)
> ------------------------------------------------------
>
>                 Key: KAFKA-19893
>                 URL: https://issues.apache.org/jira/browse/KAFKA-19893
>             Project: Kafka
>          Issue Type: Improvement
>          Components: Tiered-Storage
>            Reporter: fujian
>            Assignee: fujian
>            Priority: Major
>         Attachments: image-2025-11-18-19-19-27-579.png
>
>
> Currently, Kafka uploads all non-active local log segments to remote storage 
> even when they are still within the local retention period, resulting in 
> redundant storage of the same data in both tiers. This wastes storage 
> capacity and network bandwidth without providing immediate benefits, since 
> reads during the retention window prioritize local data.
> However, some users rely on remote storage for real-time analytics and need 
> the latest data to be available as soon as possible. Therefore, this 
> optimization is offered as an optional configuration rather than the default 
> behavior.
> !image-2025-11-18-19-19-27-579.png!
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to