Xiaogang Shi created FLINK-5036:
-----------------------------------
Summary: Perform the grouping of keys in restoring instead of
checkpointing
Key: FLINK-5036
URL: https://issues.apache.org/jira/browse/FLINK-5036
Project: Flink
Issue Type: Bug
Components: State Backends, Checkpointing
Reporter: Xiaogang Shi
Whenever taking snapshots of {{RocksDBKeyedStateBackend}}, the values in the
states will be written onto different files according to their key groups. The
procedure is very costly when the states are very big.
Given that the snapshot operations will be performed much more frequently than
restoring, we can leave the key groups as they are to improve the overall
performance. In other words, we can perform the grouping of keys in restoring
instead of in checkpointing.
I think, the implementation will be very similar to the restoring of
non-partitioned states. Each task will receive a collection of snapshots each
of which contains a set of key groups. Each task will restore its states from
the given snapshots by picking values in assigned key groups.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)