[
https://issues.apache.org/jira/browse/HIVE-18008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16246824#comment-16246824
]
Andrew Sherman commented on HIVE-18008:
---------------------------------------
I see the same break, just FYI, not piling on :-)
I see the new file in the patch but it is not in the commit;
{noformat}
[~/git/asf/hive]$ git show --name-only ff3b327d322b04916e019fcec75d3fbd48e26bae
commit ff3b327d322b04916e019fcec75d3fbd48e26bae (HEAD -> master, origin/master,
origin/HEAD)
Author: Vineet Garg <[email protected]>
Date: Thu Nov 9 15:54:11 2017 -0800
HIVE-18008 : Add optimization rule to remove gby from right side of left
semi-join (Vineet Garg, reviewed by Ashutosh Chauhan)
ql/src/java/org/apache/hadoop/hive/ql/parse/CalcitePlanner.java
ql/src/test/queries/clientpositive/subquery_in.q
ql/src/test/results/clientpositive/llap/subquery_in.q.out
ql/src/test/results/clientpositive/spark/subquery_in.q.out
ql/src/test/results/clientpositive/subquery_unqualcolumnrefs.q.out
{noformat}
> Add optimization rule to remove gby from right side of left semi-join
> ---------------------------------------------------------------------
>
> Key: HIVE-18008
> URL: https://issues.apache.org/jira/browse/HIVE-18008
> Project: Hive
> Issue Type: Improvement
> Components: Query Planning
> Reporter: Vineet Garg
> Assignee: Vineet Garg
> Attachments: HIVE-18008.1.patch, HIVE-18008.2.patch
>
>
> Group by (on same keys as semi join) as right side of Left semi join is
> unnecessary and could be removed. We see this pattern in subqueries with
> explicit distinct keyword e.g.
> {code:sql}
> explain select * from src b where b.key in (select distinct key from src a
> where a.value = b.value)
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)