[
https://issues.apache.org/jira/browse/PHOENIX-3131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403696#comment-15403696
]
tu nguyen khac edited comment on PHOENIX-3131 at 8/2/16 10:01 AM:
------------------------------------------------------------------
It 's about 80k distinct urls , tks for your support , Is 80k is large enough
to make this query too slow in sort step ? In a short java code : we can sort
80k numbers only in a about 0.3 second duration
was (Author: tuyuri):
it 's about 80k distinct urls , tks for your support , is 80k is large enough
to make this query too slow in sort step
> improve "order by " performance with aggregated query
> ------------------------------------------------------
>
> Key: PHOENIX-3131
> URL: https://issues.apache.org/jira/browse/PHOENIX-3131
> Project: Phoenix
> Issue Type: Improvement
> Affects Versions: 4.8.0
> Reporter: tu nguyen khac
> Priority: Critical
>
> I created a table in phoenix with query : ( 4 node , ram 8gb, 4 cores / node
> )
> CREATE TABLE pageview_site (
> url varchar(255) not null,
> pageview bigint,
> dt date not null,
> CONSTRAINT PK PRIMARY KEY (url, dt ROW_TIMESTAMP)
> ) SALT_BUCKETS = 4;
> After that :
> 1. I tried to upsert about : 13 milions rows to this table .
> 2. Run 2 queries :
> a. select url,sum(pageview) as pv FROM pageview_site where dt > to_date
> ('2016-06-01') group by url limit 100 offset 2;
> the duration this query in about : 0.5 second
> b. select url,sum(pageview) as pv FROM pageview_site where dt > to_date
> ('2016-06-01') group by ur order by pv descl limit 100 offset 2;
> the duration this query in about : 9.5 seconds
> what happens with 2nd query ?? I think we should improve performance for
> "order by " command
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)