[
https://issues.apache.org/jira/browse/FLINK-15507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17119283#comment-17119283
]
Zakelly Lan commented on FLINK-15507:
-------------------------------------
Thanks, [~liyu]. It's a good point for discussion.
Firstly, I think we should also enable the incremental checkpoint by default.
Thus, with default settings, user could leverage the benefit from local
recovery with incremental checkpoint in RocksDB backend, and only a little
additional disk space is consumed. On the other hand, since users choose to use
the RocksDB backend with large state, they should make sure there is enough
local disk space for it. With small disk and large state, RocksDB backend may
not work properly. The task-local full checkpoints aggravate the lack of disk
space, but not the root cause for it. As far as I understand, Flink only keep
the task-local state of the latest checkpoint, so the total disk usage is under
control.
Secondly, I think it's better not to divide the uniform local recovery
configuration into several parts, or to implement different default behaviors
for different backends. Because it may bring confusion to users.
What do you think? [~sewen] [~yunta] [~klion26]
> Activate local recovery for RocksDB backends by default
> -------------------------------------------------------
>
> Key: FLINK-15507
> URL: https://issues.apache.org/jira/browse/FLINK-15507
> Project: Flink
> Issue Type: Improvement
> Components: Runtime / State Backends
> Reporter: Stephan Ewen
> Assignee: Zakelly Lan
> Priority: Blocker
> Labels: pull-request-available
> Fix For: 1.11.0
>
>
> For the RocksDB state backend, local recovery has no overhead when
> incremental checkpoints are used.
> It should be activated by default, because it greatly helps with recovery.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)