Nico Kruber created FLINK-20217:
-----------------------------------
Summary: More fine-grained timer processing
Key: FLINK-20217
URL: https://issues.apache.org/jira/browse/FLINK-20217
Project: Flink
Issue Type: Improvement
Components: API / DataStream
Affects Versions: 1.11.2, 1.10.2, 1.12.0
Reporter: Nico Kruber
Timers are currently processed in one big block under the checkpoint lock
(under {{InternalTimerServiceImpl#advanceWatermark}}. This can be problematic
in a number of scenarios while doing checkpointing which would lead to
checkpoints timing out (and even unaligned checkpoints would not help).
If you have a huge number of timers to process when advancing the watermark and
the task is also back-pressured, the situation may actually be worse since you
would block on the checkpoint lock and also wait for buffers/credits from the
receiver.
I propose to make this loop more fine-grained so that it is interruptible by
checkpoints, but maybe there is also some other way to improve here.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)