Piotr Nowojski created FLINK-25688:
--------------------------------------
Summary: Resolve performance degradation with high parallelism
when using buffer debloating
Key: FLINK-25688
URL: https://issues.apache.org/jira/browse/FLINK-25688
Project: Flink
Issue Type: Improvement
Components: Runtime / Network
Affects Versions: 1.14.3, 1.15.0
Reporter: Piotr Nowojski
As documented in FLINK-25646, currently buffer debloating in Flink, at least in
the default configuration, has quite noticeable performance degradation at
larger scale. For example throughput can drop by a factor of 4, or even
checkpointing times can be increased. Currently it's not clear why is this
happening. It looks like increasing the number of buffers per channel from the
default ~2 to above 3 (for example via bumping number of floating buffers to
value equal or higher then parallelism), seems to be solving this problem, at
least on one cluster where buffer debloating has been tested at large scale.
Further investigation is required.
CC [~akalashnikov]
--
This message was sent by Atlassian Jira
(v8.20.1#820001)