Wan Kun created SPARK-43876:
-------------------------------
Summary: Enable fast hashmap for distinct queries
Key: SPARK-43876
URL: https://issues.apache.org/jira/browse/SPARK-43876
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 3.5.0
Reporter: Wan Kun
Spark will enable fast hash map for primitive data types in HashAggregateExec.
Could we also enable this for distinct queries which bufferSchema is empty.
For example, we can also build a fast hash map with the key a + b for query
{code:sql} SELECT distinct a, b from tab {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]