[
https://issues.apache.org/jira/browse/SPARK-36082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18078534#comment-18078534
]
Jay Bharadwaj commented on SPARK-36082:
---------------------------------------
Opened a PR [here|https://github.com/apache/spark/pull/55678]
> when the right side is small enough to use SingleColumn Null Aware Anti Join
> ----------------------------------------------------------------------------
>
> Key: SPARK-36082
> URL: https://issues.apache.org/jira/browse/SPARK-36082
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 3.2.0, 3.1.3
> Reporter: mcdull_zhang
> Priority: Minor
> Labels: pull-request-available
>
> NULL-aware ANTI join (https://issues.apache.org/jira/browse/SPARK-32290) will
> build right side into a HashMap.
> code in SparkStrategy:
>
> {code:java}
> case j @ ExtractSingleColumnNullAwareAntiJoin(leftKeys, rightKeys) =>
> Seq(joins.BroadcastHashJoinExec(leftKeys, rightKeys, LeftAnti, BuildRight,
> None, planLater(j.left), planLater(j.right), isNullAwareAntiJoin =
> true)){code}
> we should add the conditions and use this optimization when the size of the
> right side is small enough.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]