Wan Kun created SPARK-43876:
-------------------------------

             Summary: Enable fast hashmap for distinct queries
                 Key: SPARK-43876
                 URL: https://issues.apache.org/jira/browse/SPARK-43876
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.5.0
            Reporter: Wan Kun


Spark will enable fast hash map for primitive data types in HashAggregateExec.

Could we also enable this for distinct queries which bufferSchema is empty.

For example, we can also build a fast hash map with the key a + b for query
{code:sql} SELECT distinct a, b from tab {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to