[ 
https://issues.apache.org/jira/browse/HIVE-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-2597:
------------------------------

    Attachment: HIVE-2597.D8967.1.patch

navis requested code review of "HIVE-2597 [jira] Repeated key in GROUP BY is 
erroneously displayed when using DISTINCT".

Reviewers: JIRA

HIVE-2597 Repeated key in GROUP BY is erroneously displayed when using DISTINCT

The following query was simplified for illustration purposes.

This works correctly:
select client_tid, "" as myvalue1, "" as myvalue2 from clients cluster by 
client_tid

The intent here is to produce two empty columns in between data.

The following query does not work:
select distinct client_tid, "" as myvalue1, "" as myvalue2 from clients cluster 
by client_tid

FAILED: Error in semantic analysis: Line 1:44 Repeated key in GROUP BY ""

The key is not repeated since the aliases were given. Seems like Hive is 
ignoring the aliases when the "distinct" keyword is specified.

TEST PLAN
  EMPTY

REVISION DETAIL
  https://reviews.facebook.net/D8967

AFFECTED FILES
  ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
  ql/src/test/queries/clientpositive/groupby_constant.q
  ql/src/test/results/clientpositive/groupby_constant.q.out

MANAGE HERALD RULES
  https://reviews.facebook.net/herald/view/differential/

WHY DID I GET THIS EMAIL?
  https://reviews.facebook.net/herald/transcript/21711/

To: JIRA, navis

                
> Repeated key in GROUP BY is erroneously displayed when using DISTINCT
> ---------------------------------------------------------------------
>
>                 Key: HIVE-2597
>                 URL: https://issues.apache.org/jira/browse/HIVE-2597
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Alex Rovner
>            Assignee: Navis
>         Attachments: HIVE-2597.D8967.1.patch
>
>
> The following query was simplified for illustration purposes. 
> This works correctly:
> select client_tid, "" as myvalue1, "" as myvalue2 from clients cluster by 
> client_tid
> The intent here is to produce two empty columns in between data.
> The following query does not work:
> select distinct client_tid, "" as myvalue1, "" as myvalue2 from clients 
> cluster by client_tid
> FAILED: Error in semantic analysis: Line 1:44 Repeated key in GROUP BY ""
> The key is not repeated since the aliases were given. Seems like Hive is 
> ignoring the aliases when the "distinct" keyword is specified.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to