GitHub user tillrohrmann opened a pull request:
https://github.com/apache/flink/pull/5444
[FLINK-8546] [flip6] Respect savepoints and restore from latest checkpoints
## What is the purpose of the change
Let the JobMaster respect checkpoints and savepoints. The JobMaster will
always
try to restore the latest checkpoint if there is one available. Next it
will check
whether savepoint restore settings have been set. If so, then it will try
to restore
the savepoint. Only if these settings are not set, the job will be started
from
scratch.
This PR is based on #5443.
## Brief change log
- Check in JobMaster if the `CheckpointCoordinator` has been set
- If so, then check if there is a checkpoint to recover
- If not, then check whether we can restore from a savepoint
- If not, then start the job without any recovered state
## Verifying this change
- Added `JobMasterTest#testRestoringFromSavepoint` and
`JobMasterTest#testCheckpointPrecedesSavepointRecovery`.
## Does this pull request potentially affect one of the following parts:
- Dependencies (does it add or upgrade a dependency): (no)
- The public API, i.e., is any changed class annotated with
`@Public(Evolving)`: (no)
- The serializers: (no)
- The runtime per-record code paths (performance sensitive): (no)
- Anything that affects deployment or recovery: JobManager (and its
components), Checkpointing, Yarn/Mesos, ZooKeeper: (yes)
- The S3 file system connector: (no)
## Documentation
- Does this pull request introduce a new feature? (no)
- If yes, how is the feature documented? (not applicable)
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/tillrohrmann/flink respectSavepoints
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/flink/pull/5444.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #5444
commit 8e172aee75e9e0c81608ed7e6796bca1ea7d7544
Author: Till Rohrmann
Date: 2018-02-09T13:07:31Z
[FLINK-8626] Introduce BackPressureStatsTracker interface
Renames BackPressureStatsTracker into BackPressureStatsTrackerImpl and
introduce
a BackPressureStatsTracker interface. This will make testing easier when we
don't
have to set up all the different components.
commit 8380146fb2f85d4e8d9d41b84ba0ad435c242984
Author: Till Rohrmann
Date: 2018-02-09T13:18:11Z
[hotfix] [tests] Simplify JobMasterTest
commit 09d36a47ed78d7fae0cae0229823114bbb6d45be
Author: Till Rohrmann
Date: 2018-02-01T15:14:53Z
[FLINK-8546] [flip6] Respect savepoints and restore from latest checkpoints
Let the JobMaster respect checkpoints and savepoints. The JobMaster will
always
try to restore the latest checkpoint if there is one available. Next it
will check
whether savepoint restore settings have been set. If so, then it will try
to restore
the savepoint. Only if these settings are not set, the job will be started
from
scratch.
---