dawidwys opened a new pull request #18539:
URL: https://github.com/apache/flink/pull/18539
## What is the purpose of the change
Incremental savepoints do not reuse sst files from previous checkpoints.
At this point they re-upload those files. Moreover they do not register
its files as reusable by future checkpoints. Lastly, all sst files are
created in the EXCLUSIVE scope with relative paths, which makes the
savepoint relocatable.
In order to support CLAIM mode for such savepoints, if a CLAIMed
snapshots contain shared files, we delay deleting the exclusive
directory until shared files coming from that snapshot are not used
anymore.
## Verifying this change
Added tests:
* SavepointFormatITCase
* SharedStateRegistryTest
## Does this pull request potentially affect one of the following parts:
- Dependencies (does it add or upgrade a dependency): (yes / **no**)
- The public API, i.e., is any changed class annotated with
`@Public(Evolving)`: (yes / **no**)
- The serializers: (yes / **no** / don't know)
- The runtime per-record code paths (performance sensitive): (yes / **no**
/ don't know)
- Anything that affects deployment or recovery: JobManager (and its
components), Checkpointing, Kubernetes/Yarn, ZooKeeper: (**yes** / no / don't
know)
- The S3 file system connector: (yes / **no** / don't know)
## Documentation
- Does this pull request introduce a new feature? (**yes** / no)
- If yes, how is the feature documented? (not applicable / docs /
**JavaDocs** / not documented)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]