[jira] Commented: (HIVE-287) count distinct on multiple columns does not work

John Sichi (JIRA) Fri, 25 Jun 2010 18:02:19 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12882792#action_12882792
 ]


John Sichi commented on HIVE-287:
---------------------------------

Sorry, my commit for HIVE-1387 just introduced a new conflict, so one more 
rebase is required.

I still don't think that we should allow anything other than COUNT(*) in the 
parser.  SUM(*) should be disallowed, as should COUNT(DISTINCT *), as should 
MY_UDF(*).  They are non-standard, and I think they would just cause confusion 
since COUNT(*) ignores the column values rather than doing anything with them.  
Together with preventing these, add negative tests to verify that they are 
rejected.

Given that, we can get rid of the isAllColumns info.

Also, can you add an EXPLAIN for the COUNT(*) query in the test?

After that, I think we'll be good to go.


> count distinct on multiple columns does not work
> ------------------------------------------------
>
>                 Key: HIVE-287
>                 URL: https://issues.apache.org/jira/browse/HIVE-287
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Arvind Prabhakar
>         Attachments: HIVE-287-1.patch, HIVE-287-2.patch, HIVE-287-3.patch, 
> HIVE-287-4.patch
>
>
> The following query does not work:
> select count(distinct col1, col2) from Tbl

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HIVE-287) count distinct on multiple columns does not work

Reply via email to