Stefan Richter created FLINK-6484:
-------------------------------------

             Summary: Send only the registry keys for already registered files 
in incremental checkpointing
                 Key: FLINK-6484
                 URL: https://issues.apache.org/jira/browse/FLINK-6484
             Project: Flink
          Issue Type: Improvement
          Components: State Backends, Checkpointing
            Reporter: Stefan Richter


State handles for files that are (potentially) shared across multiple 
incremental snapshots are registered under an explicit key in the 
{{SharedStateRegistry}}. Furthermore, the backend is aware which of those files 
are new (unregistered) and which are just references to a previous snapshot 
(already) registered.

The current implementation of incremental checkpoints in RocksDB always 
includes _all_ state handles (unregistered and previously registered) that are 
referenced by the snapshot in the ack message to the checkpoint coordinator, 
assuming that state handles are lightweight pointers.

While this assumption is true in general, there are notable exceptions such as 
the {{ByteStreamStateHandle}}, which pack the actual payload data into the 
state handle. While the maximum capacity for each {{ByteStreamStateHandle}} is 
limited (around 4MB), multiple handles can be part of a snapshot. This makes 
incremental snapshots over {{ByteStreamStateHandle}} essentially not 
incremental and can lead to huge ack messages.

To avoid this issue, I propose that we only send the registration key for all 
previously registered state handles and fetch the corresponding state handle 
from the {{SharedStateRegistry}} in the checkpoint coordinator to insert them 
into the checkpoint data.
 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to