[jira] [Commented] (KUDU-2380) Selective predicates when selecting high number of columns burns CPU in SerializeRowBlock

Todd Lipcon (JIRA) Tue, 11 Jun 2019 12:02:15 -0700


    [ 
https://issues.apache.org/jira/browse/KUDU-2380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861361#comment-16861361
 ]


Todd Lipcon commented on KUDU-2380:
-----------------------------------

It seems that 7201b0635a448cd244dc598158b13226f14aa0fb improved this for the 
case where an entire rowblock is eliminated (either by deletions or by not 
matching a predicate). I tested COUNT(*) vs * on a table using a predicate with 
no matches and found that KUDU-2381 was responsible for the main difference 
between the two.



> Selective predicates when selecting high number of columns burns CPU in 
> SerializeRowBlock
> -----------------------------------------------------------------------------------------
>
>                 Key: KUDU-2380
>                 URL: https://issues.apache.org/jira/browse/KUDU-2380
>             Project: Kudu
>          Issue Type: Improvement
>          Components: perf, tserver
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>            Priority: Major
>         Attachments: pprof.kudu-tserver.samples.cpu.001.pb.gz
>
>
> Testing a table with 280 columns, I found the following performance 
> characteristic:
> - scanning all 280 columns with a selective non-key predicate which matches 0 
> rows took 8.28s
> - scanning no columns (count query) with the same predicate which matches 0 
> rows took 314ms.
> This suggests that we are burning 96% of our CPU doing useless work for this 
> query.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (KUDU-2380) Selective predicates when selecting high number of columns burns CPU in SerializeRowBlock

Reply via email to