[
https://issues.apache.org/jira/browse/SPARK-40986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
jiaan.geng updated SPARK-40986:
-------------------------------
Summary: Using distinct to reduce the data size for bloom filter (was: Add
extra aggregate on join key for bloom filter)
> Using distinct to reduce the data size for bloom filter
> -------------------------------------------------------
>
> Key: SPARK-40986
> URL: https://issues.apache.org/jira/browse/SPARK-40986
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 3.4.0
> Reporter: jiaan.geng
> Priority: Major
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]