[ 
https://issues.apache.org/jira/browse/KAFKA-12520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dhruvil Shah reassigned KAFKA-12520:
------------------------------------

    Assignee: Dhruvil Shah

> Producer state is needlessly rebuilt on startup
> -----------------------------------------------
>
>                 Key: KAFKA-12520
>                 URL: https://issues.apache.org/jira/browse/KAFKA-12520
>             Project: Kafka
>          Issue Type: Bug
>            Reporter: Dhruvil Shah
>            Assignee: Dhruvil Shah
>            Priority: Major
>
> When we find a {{.swap}} file on startup, we typically want to rename and 
> replace it as {{.log}}, {{.index}}, {{.timeindex}}, etc. as a way to complete 
> any ongoing replace operations. These swap files are usually known to have 
> been flushed to disk before the replace operation begins.
> One flaw in the current logic is that when we recover these swap files on 
> startup, we end up truncating the producer state and rebuild it from scratch. 
> This is unneeded as the replace operation does not mutate the producer state 
> by itself. It is only meant to replace the {{.log}} file along with 
> corresponding indices.
> Because of this unneeded producer state rebuild operation, we have seen 
> multi-hour startup times for clusters that have large compacted topics.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to