dawidwys opened a new pull request #18539:
URL: https://github.com/apache/flink/pull/18539


   ## What is the purpose of the change
   
   Incremental savepoints do not reuse sst files from previous checkpoints.
   At this point they re-upload those files. Moreover they do not register
   its files as reusable by future checkpoints. Lastly, all sst files are
   created in the EXCLUSIVE scope with relative paths, which makes the
   savepoint relocatable.
   
   In order to support CLAIM mode for such savepoints, if a CLAIMed
   snapshots contain shared files, we delay deleting the exclusive
   directory until shared files coming from that snapshot are not used
   anymore.
   
   ## Verifying this change
   
   Added tests:
   * SavepointFormatITCase
   * SharedStateRegistryTest
   
   ## Does this pull request potentially affect one of the following parts:
   
     - Dependencies (does it add or upgrade a dependency): (yes / **no**)
     - The public API, i.e., is any changed class annotated with 
`@Public(Evolving)`: (yes / **no**)
     - The serializers: (yes / **no** / don't know)
     - The runtime per-record code paths (performance sensitive): (yes / **no** 
/ don't know)
     - Anything that affects deployment or recovery: JobManager (and its 
components), Checkpointing, Kubernetes/Yarn, ZooKeeper: (**yes** / no / don't 
know)
     - The S3 file system connector: (yes / **no** / don't know)
   
   ## Documentation
   
     - Does this pull request introduce a new feature? (**yes** / no)
     - If yes, how is the feature documented? (not applicable / docs / 
**JavaDocs** / not documented)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to