[
https://issues.apache.org/jira/browse/FLINK-26029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Flink Jira Bot updated FLINK-26029:
-----------------------------------
Labels: extensibility stale-assigned (was: extensibility)
I am the [Flink Jira Bot|https://github.com/apache/flink-jira-bot/] and I help
the community manage its development. I see this issue is assigned but has not
received an update in 30 days, so it has been labeled "stale-assigned".
If you are still working on the issue, please remove the label and add a
comment updating the community on your progress. If this issue is waiting on
feedback, please consider this a reminder to the committer/reviewer. Flink is a
very active project, and so we appreciate your patience.
If you are no longer working on the issue, please unassign yourself so someone
else may work on it.
> Generalize the checkpoint protocol of OperatorCoordinator.
> ----------------------------------------------------------
>
> Key: FLINK-26029
> URL: https://issues.apache.org/jira/browse/FLINK-26029
> Project: Flink
> Issue Type: Improvement
> Components: Runtime / Checkpointing
> Affects Versions: 1.14.3
> Reporter: Jiangjie Qin
> Assignee: Dong Lin
> Priority: Major
> Labels: extensibility, stale-assigned
> Fix For: 1.16.0
>
>
> Currently the JM opens all the event valves from the OperatorCoordinator to
> the subtasks after the checkpoint barriers are sent to the Source subtasks.
> While this works for the Source Operators, it unnecessarily limits general
> usage of the OperatorCoordinator for other operators.
> To generalize the protocol, we can change the JM to open the event valve of
> the subtasks that have finished the local checkpoint. So the protocol would
> become following:
> # Let the OC finish processing all the incoming OperatorEvents before the
> snapshot.
> # Wait until all the outgoing OperatorEvents before the snapshot are sent
> and acked.
> # Shut the event valve so no outgoing events can be sent to the subtasks.
> # Send checkpoint barriers to the Source operators.
> # Open the corresponding event valve of a subtask when the
> AcknowledgeCheckpoint messages from that subtask is received.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)