Piotr Nowojski created FLINK-17122:
--------------------------------------
Summary: Support InputSelectable and BoundedMultiInput operators
with checkpointing
Key: FLINK-17122
URL: https://issues.apache.org/jira/browse/FLINK-17122
Project: Flink
Issue Type: Wish
Components: Runtime / Checkpointing, Runtime / Network
Affects Versions: 1.10.0, 1.9.2
Reporter: Piotr Nowojski
Currently when user defined some {{InputSelectable}} or {{BoundedMultiInput}}
operators, checkpointing is not supported. Main problem is the that combination
of {{InputSelectable}} and barrier alignment can lead to deadlocks (checkpoint
barrier stuck on not selected channel).
Problem could be somehow mitigated via unaligned checkpoints (FLINK-14551), but
not fully. Even with unaligned checkpoints, checkpoint barriers can be stuck in
the job graph if there is a {{flatMap}} operator (or non {{flatMap}} operator
but if records are spanning multiple buffers), blocked in the middle of
processing by some down stream input selection. In such case we are not able to
perform unaligned checkpoint.
Potential solution could be using persistent communication channels or
detecting before mentioned situations and avoid the dead lock by spilling
excess data.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)