tsreaper opened a new pull request, #225:
URL: https://github.com/apache/flink-table-store/pull/225

   Currently snapshots are committed in `notifyCheckpointComplete`. If the job 
fails between a successful checkpoint and the call of 
`notifyCheckpointComplete`, these snapshots will be committed after job 
restarts.
   
   However when the writer starts they also need to read from the latest 
snapshot (to build the latest structure of LSM tree). These two steps may 
happen concurrently and what the writers see may not be the latest snapshot.
   
   To fix this problem, we can throw exception intentionally after new 
snapshots are committed during restore. In this way the job will be forcefully 
restarted and it is very likely that the writers can see the latest snapshot.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to