Riza Suminto created IMPALA-12200:
-------------------------------------
Summary: Cap stats NDV from SetOperationStmt.createMetadata
Key: IMPALA-12200
URL: https://issues.apache.org/jira/browse/IMPALA-12200
Project: IMPALA
Issue Type: Improvement
Components: Frontend
Affects Versions: Impala 4.3.0
Reporter: Riza Suminto
Assignee: Riza Suminto
Union operator will create merged ColumnStats at
SetOperationStmt.createMetadata where it adds all ColumnStats from its input
children. One of the stats being accumulated is NDV (num distinct value). There
is an opportunity to lower the resulting NDV if all source expression is
referring to the same column. This lower NDV can benefit Aggregation node on
top of the Union node because it can lower cardinality and memory estimate of
the Aggregation node.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)