[
https://issues.apache.org/jira/browse/SPARK-23564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-23564:
------------------------------------
Assignee: (was: Apache Spark)
> the optimized logical plan about Left anti join should be further
> optimization
> -------------------------------------------------------------------------------
>
> Key: SPARK-23564
> URL: https://issues.apache.org/jira/browse/SPARK-23564
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 2.3.0
> Reporter: KaiXinXIaoLei
> Priority: Major
>
> The Optimized Logical Plan of the query '*select * from tt1 left anti join
> tt2 on tt2.i = tt1.i*' is
>
> {code:java}
> == Optimized Logical Plan ==
> Join LeftAnti, (i#2 = i#0)
> :- HiveTableRelation `default`.`tt1`,
> org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, [i#0, s#1]
> +- Project [i#2]
> +- HiveTableRelation `default`.`tt2`,
> org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, [i#2, s#3]
> {code}
>
>
> this plan can be further optimization by 'Filter isnotnull' of right table,
> as follow:
> {code:java}
> == Optimized Logical Plan ==
> Join LeftAnti, (i#2 = i#0)
> :- HiveTableRelation `default`.`tt1`,
> org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, [i#0, s#1]
> +- Project [i#2]
> +- Filter isnotnull(i#3)
> +- HiveTableRelation `default`.`tt2`,
> org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, [i#2, s#3]
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]