[
https://issues.apache.org/jira/browse/SPARK-34681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-34681:
------------------------------------
Assignee: Apache Spark
> Full outer shuffled hash join when building left side produces wrong result
> ---------------------------------------------------------------------------
>
> Key: SPARK-34681
> URL: https://issues.apache.org/jira/browse/SPARK-34681
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 3.2.0, 3.1.1
> Reporter: Cheng Su
> Assignee: Apache Spark
> Priority: Blocker
>
> For full outer shuffled hash join with building hash map on left side, and
> having non-equal condition, the join can produce wrong result.
> The root cause is `boundCondition` in `HashJoin.scala` always assumes the
> left side row is `streamedPlan` and right side row is `buildPlan`
> (streamedPlan.output ++ buildPlan.output). This is valid assumption, except
> for full outer + build left case.
> The fix is to correct `boundCondition` in `HashJoin.scala` to handle full
> outer + build left case properly. See reproduce in
> https://issues.apache.org/jira/browse/SPARK-32399?focusedCommentId=17298414&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17298414
> .
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]