Hi,

I think a) doesn't hold because there is no synchronisation between the 
CheckpointCoordinator and the sources doing the reading. I think b) will hold 
but it's also not exact because of clock differences between different machines 
and whatnot.

Best,
Aljoscha

> On 29. Jan 2018, at 15:34, Juho Autio <juho.au...@rovio.com> wrote:
> 
> I'm triggering nightly savepoints at 23:59:00 with crontab on the flink 
> cluster.
> 
> For example last night's savepoint has this information:
> 
> Trigger Time: 23:59:14
> Latest Acknowledgement: 00:00:59
> 
> What are the min/max boundaries for the data contained by the savepoint? Can 
> I deduce from this either of the following:
> 
> a) the savepoint cannot contain any data that was produced after 23:59:14
> b) the savepoint cannot contain any data that was produced after 00:00:59
> 
> My use case is like this: if I restore the nightly savepoint, I want to be 
> sure that any data that was produced during the current day will be included 
> (+ some data from the previous day, that's ok). If the answer to above 
> question is that (a) is false, but (b) holds, that would mean that I would 
> need to trigger the savepoint early enough for it to complete before the 
> midnight.
> 
> Something from the docs that doesn't seem to answer my question:
> 
> > Trigger Time: The time when the checkpoint was triggered at the JobManager.
> > Latest Acknowledgement: The time when the latest acknowledged for any 
> > subtask was received at the JobManager (or n/a if no acknowledgement 
> > received yet).
> 
> https://ci.apache.org/projects/flink/flink-docs-release-1.4/monitoring/checkpoint_monitoring.html#history-tab
>  
> <https://ci.apache.org/projects/flink/flink-docs-release-1.4/monitoring/checkpoint_monitoring.html#history-tab>

Reply via email to