[
https://issues.apache.org/jira/browse/FLINK-12619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16857585#comment-16857585
]
Yu Li edited comment on FLINK-12619 at 6/6/19 12:11 PM:
--------------------------------------------------------
Hi [~aljoscha], just saw your comments. I agree that both checkpoint and
savepoint controls snapshot, but my 2 cents here: from our
[document|https://ci.apache.org/projects/flink/flink-docs-release-1.8/ops/state/savepoints.html]
savepoint differs from checkpoint just like backups are different from
recovery logs in traditional database systems. And making the solution in a way
of "Add support for choosing the snapshot format for stop-with-savepoint"
sounds like binding the two concepts together and is a little bit confusing. It
sounds like savepoint could take use of the (incremental) checkpoint format but
actually they are in different formats (and if I understand it correctly,
FLIP-41 is trying to unify the savepoint format while allowing different state
backends have their own different checkpoint format). Wdyt? Thanks.
was (Author: carp84):
Hi [~aljoscha], just saw your comments. I agree that both checkpoint and
savepoint controls snapshot, but my 2 cents here: from our
[document|https://ci.apache.org/projects/flink/flink-docs-release-1.8/ops/state/savepoints.html]
savepoint differs from checkpoint just like backups are different from
recovery logs in traditional database systems. And making the solution in a way
of "Add support for choosing the snapshot format for stop-with-savepoint"
sounds like binding the two concepts together and is a little bit confusing.
Wdyt? Thanks.
> Support TERMINATE/SUSPEND Job with Checkpoint
> ---------------------------------------------
>
> Key: FLINK-12619
> URL: https://issues.apache.org/jira/browse/FLINK-12619
> Project: Flink
> Issue Type: New Feature
> Components: Runtime / State Backends
> Reporter: Congxian Qiu(klion26)
> Assignee: Congxian Qiu(klion26)
> Priority: Major
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Inspired by the idea of FLINK-11458, we propose to support terminate/suspend
> a job with checkpoint. This improvement cooperates with incremental and
> external checkpoint features, that if checkpoint is retained and this feature
> is configured, we will trigger a checkpoint before the job stops. It could
> accelarate job recovery a lot since:
> 1. No source rewinding required any more.
> 2. It's much faster than taking a savepoint since incremental checkpoint is
> enabled.
> Please note that conceptually savepoints is different from checkpoint in a
> similar way that backups are different from recovery logs in traditional
> database systems. So we suggest using this feature only for job recovery,
> while stick with FLINK-11458 for the
> upgrading/cross-cluster-job-migration/state-backend-switch cases.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)