[ 
https://issues.apache.org/jira/browse/FLINK-28605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated FLINK-28605:
-----------------------------------
    Labels: pull-request-available  (was: )

> Throw exception intentionally when new snapshots are committed during restore
> -----------------------------------------------------------------------------
>
>                 Key: FLINK-28605
>                 URL: https://issues.apache.org/jira/browse/FLINK-28605
>             Project: Flink
>          Issue Type: Improvement
>          Components: Table Store
>    Affects Versions: table-store-0.2.0
>            Reporter: Caizhi Weng
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: table-store-0.2.0
>
>
> Currently snapshots are committed in {{notifyCheckpointComplete}}. If the job 
> fails between a successful checkpoint and the call of 
> {{notifyCheckpointComplete}}, these snapshots will be committed after job 
> restarts.
> However when the writer starts they also need to read from the latest 
> snapshot (to build the latest structure of LSM tree). These two steps may 
> happen concurrently and what the writers see may not be the latest snapshot.
> To fix this problem, we can throw exception intentionally after new snapshots 
> are committed during restore. In this way the job will be forcefully 
> restarted and it is very likely that the writers can see the latest snapshot.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to