caican created SPARK-44426:
------------------------------
Summary: optimize adaptive skew join for ExistenceJoin
Key: SPARK-44426
URL: https://issues.apache.org/jira/browse/SPARK-44426
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 3.4.0, 3.3.0, 3.2.0, 3.1.2
Reporter: caican
For this query, InSubQuery would be cast to `ExistenceJoin` and now
`ExistenceJoin` does not support automatic data skew for the left table.
{code:java}
SELECT * FROM skewData1
where
(key1 in (select key2 from skewData2)
or value1 in (select value2 from skewData2){code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]