Hi, I think a) doesn't hold because there is no synchronisation between the CheckpointCoordinator and the sources doing the reading. I think b) will hold but it's also not exact because of clock differences between different machines and whatnot.
Best, Aljoscha > On 29. Jan 2018, at 15:34, Juho Autio <juho.au...@rovio.com> wrote: > > I'm triggering nightly savepoints at 23:59:00 with crontab on the flink > cluster. > > For example last night's savepoint has this information: > > Trigger Time: 23:59:14 > Latest Acknowledgement: 00:00:59 > > What are the min/max boundaries for the data contained by the savepoint? Can > I deduce from this either of the following: > > a) the savepoint cannot contain any data that was produced after 23:59:14 > b) the savepoint cannot contain any data that was produced after 00:00:59 > > My use case is like this: if I restore the nightly savepoint, I want to be > sure that any data that was produced during the current day will be included > (+ some data from the previous day, that's ok). If the answer to above > question is that (a) is false, but (b) holds, that would mean that I would > need to trigger the savepoint early enough for it to complete before the > midnight. > > Something from the docs that doesn't seem to answer my question: > > > Trigger Time: The time when the checkpoint was triggered at the JobManager. > > Latest Acknowledgement: The time when the latest acknowledged for any > > subtask was received at the JobManager (or n/a if no acknowledgement > > received yet). > > https://ci.apache.org/projects/flink/flink-docs-release-1.4/monitoring/checkpoint_monitoring.html#history-tab > > <https://ci.apache.org/projects/flink/flink-docs-release-1.4/monitoring/checkpoint_monitoring.html#history-tab>