[
https://issues.apache.org/jira/browse/FLINK-19286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zhu Zhu closed FLINK-19286.
---------------------------
Resolution: Fixed
Done via:
6045da062d30fcaea1b2eaf5fb879fd89b7e920e
f89c137eabb5b61826dac143eca8d804aa99d9f6
247e45214464efc2943cedb9e6766d50171ff236
> Improve pipelined region scheduling performance
> -----------------------------------------------
>
> Key: FLINK-19286
> URL: https://issues.apache.org/jira/browse/FLINK-19286
> Project: Flink
> Issue Type: Sub-task
> Components: Runtime / Coordination
> Affects Versions: 1.12.0
> Reporter: Zhu Zhu
> Assignee: Zhu Zhu
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.12.0
>
>
> In my recent TPCDS benchmark, pipelined region scheduling is slower than
> lazy-from-sources scheduling.
> The regression is due to some suboptimal implementation of
> {{PipelinedRegionSchedulingStrategy}}, including:
> 1. topologically sorting of vertices to deploy
> 2. unnecessary O(V) loop when sorting an empty set of regions
> After improving these implementations, pipelined region scheduling turned to
> be 10% faster in the previous benchmark setup.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)