LiuJi created FLINK-11425:
-----------------------------
Summary: Support of “Hash Teams” in Hybrid Hash Join
Key: FLINK-11425
URL: https://issues.apache.org/jira/browse/FLINK-11425
Project: Flink
Issue Type: New Feature
Components: Core, Optimizer
Reporter: LiuJi
Hybrid Hash Join is already supported in current version. The join starts
operating in memory and gradually starts spilling contents to disk, when the
memory is not sufficient.
Current hash join only support two inputs, so when a job contains multiple
hash joins which have the same join keys, it will consume some unnecessary
resources (I/O, memory, etc) because some upstream output data may useless for
downstream hash join.
According to the above observations, we want to provide a HashTeamManager to
implement multiway inputs hash join by combining several two way hash join
which have same join keys. HashTeamManager manage the relations of multiple
HashTables and improve efficiency in memory use and lower I/O operations by
joining multiple relations at one time.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)