[ https://issues.apache.org/jira/browse/HIVE-18008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16246824#comment-16246824 ]
Andrew Sherman commented on HIVE-18008: --------------------------------------- I see the same break, just FYI, not piling on :-) I see the new file in the patch but it is not in the commit; {noformat} [~/git/asf/hive]$ git show --name-only ff3b327d322b04916e019fcec75d3fbd48e26bae commit ff3b327d322b04916e019fcec75d3fbd48e26bae (HEAD -> master, origin/master, origin/HEAD) Author: Vineet Garg <vg...@apache.com> Date: Thu Nov 9 15:54:11 2017 -0800 HIVE-18008 : Add optimization rule to remove gby from right side of left semi-join (Vineet Garg, reviewed by Ashutosh Chauhan) ql/src/java/org/apache/hadoop/hive/ql/parse/CalcitePlanner.java ql/src/test/queries/clientpositive/subquery_in.q ql/src/test/results/clientpositive/llap/subquery_in.q.out ql/src/test/results/clientpositive/spark/subquery_in.q.out ql/src/test/results/clientpositive/subquery_unqualcolumnrefs.q.out {noformat} > Add optimization rule to remove gby from right side of left semi-join > --------------------------------------------------------------------- > > Key: HIVE-18008 > URL: https://issues.apache.org/jira/browse/HIVE-18008 > Project: Hive > Issue Type: Improvement > Components: Query Planning > Reporter: Vineet Garg > Assignee: Vineet Garg > Attachments: HIVE-18008.1.patch, HIVE-18008.2.patch > > > Group by (on same keys as semi join) as right side of Left semi join is > unnecessary and could be removed. We see this pattern in subqueries with > explicit distinct keyword e.g. > {code:sql} > explain select * from src b where b.key in (select distinct key from src a > where a.value = b.value) > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029)