[
https://issues.apache.org/jira/browse/FLINK-25774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yingjie Cao updated FLINK-25774:
--------------------------------
Summary: Restrict the maximum number of buffers can be used per result
partition for sort-shuffle (was: Restrict the maximum number of buffers can be
used per result partition for blocking shuffle)
> Restrict the maximum number of buffers can be used per result partition for
> sort-shuffle
> ----------------------------------------------------------------------------------------
>
> Key: FLINK-25774
> URL: https://issues.apache.org/jira/browse/FLINK-25774
> Project: Flink
> Issue Type: Sub-task
> Components: Runtime / Network
> Reporter: Yingjie Cao
> Priority: Major
> Fix For: 1.15.0
>
>
> Currently, for blocking shuffle, the maximum number of buffers can be used
> per result partition is Integer.MAX_VALUE. For hash-shuffle, the maximum
> number of buffers to be used is (numSubpartition + 1), because the
> hash-shuffle implementation always flush the previous buffer after a new
> buffer is added, so setting the maximum number of buffers can be used to
> Integer.MAX_VALUE is meaningless. For sort-shuffle, if too many buffers are
> taken by one result partition, other result partitions and input gates may
> spend too much time waiting for buffers which can influence performance. This
> ticket aims to restrict the maximum number of buffers can be used per result
> partition and the selected value is an empirical one based on the TPC-DS test
> results.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)