[
https://issues.apache.org/jira/browse/KAFKA-12634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17317393#comment-17317393
]
A. Sophie Blee-Goldman commented on KAFKA-12634:
------------------------------------------------
Nice catch. Do you know which versions this affects?
> Should checkpoint after restore finished
> ----------------------------------------
>
> Key: KAFKA-12634
> URL: https://issues.apache.org/jira/browse/KAFKA-12634
> Project: Kafka
> Issue Type: Improvement
> Components: streams
> Reporter: Matthias J. Sax
> Priority: Major
>
> For state stores, Kafka Streams maintains local checkpoint files to track the
> offsets of the state store changelog topics. The checkpoint is updated on
> commit or when a task is closed cleanly.
> However, after a successful restore, the checkpoint is not written. Thus, if
> an instance crashes after restore but before committing, even if the state is
> on local disk the checkpoint file is missing (indicating that there is no
> state) and thus state would be restored from scratch.
> While for most cases, the time between restore end and next commit is small,
> there are cases when this time could be large, for example if there is no new
> input data to be processed (if there is no input data, the commit would be
> skipped).
> Thus, we should write the checkpoint file after a successful restore to close
> this gap (or course, only for at-least-once processing).
--
This message was sent by Atlassian Jira
(v8.3.4#803005)