[jira] [Updated] (PHOENIX-2965) Use DistinctPrefixFilter logic for COUNT(DISTINCT ...) and COUNT(...) GROUP BY

Lars Hofhansl (JIRA) Sat, 11 Jun 2016 22:07:52 -0700

     [ 
https://issues.apache.org/jira/browse/PHOENIX-2965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Lars Hofhansl updated PHOENIX-2965:
-----------------------------------
    Attachment: 2965-v11.txt

-v11 looks to be correct, but misses some optimizations. Simply checks in 
static GroupBy.compile whether there are any HAVING expressions in case of an 
aggregate. If so, it avoid the optimization.

As I said, this is getting fickle. Not a big fan of fragile code, much less a 
fan of adding such code.

We could also just document to use {{SELECT COUNT(...) FROM (SELECT 
DISTINCT(...) FROM T)}} for cases where this is possible.


> Use DistinctPrefixFilter logic for COUNT(DISTINCT ...) and COUNT(...) GROUP BY
> ------------------------------------------------------------------------------
>
>                 Key: PHOENIX-2965
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-2965
>             Project: Phoenix
>          Issue Type: Sub-task
>            Reporter: Lars Hofhansl
>            Assignee: Lars Hofhansl
>             Fix For: 4.8.0
>
>         Attachments: 2965-v10.txt, 2965-v11.txt, 2965-v2.txt, 2965-v3.txt, 
> 2965-v4.txt, 2965-v5.txt, 2965-v6.txt, 2965-v7.txt, 2965-v8.txt, 2965-v9.txt, 
> 2965.txt, PHOENIX-2965_wip.patch
>
>
> Parent uses skip scanning to optimize DISTINCT and certain GROUP BY 
> operations along the row key.
> COUNT queries are optimized differently, could be sped up significantly as 
> well.
> [~giacomotaylor], I might need to help into where COUNT(DISTINCT) queries are 
> planned and optimized.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (PHOENIX-2965) Use DistinctPrefixFilter logic for COUNT(DISTINCT ...) and COUNT(...) GROUP BY

Reply via email to