[jira] [Commented] (ARROW-13451) [C++][Compute] Consider removing ScalarAggregateKernel

Ben Kietzman (Jira) Mon, 09 Aug 2021 10:11:04 -0700


    [ 
https://issues.apache.org/jira/browse/ARROW-13451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17396170#comment-17396170
 ]


Ben Kietzman commented on ARROW-13451:
--------------------------------------

At the same time, it'd be nice to make the helper functions 
{{internal::GetKernels}} etc public and well tested.

See also: https://github.com/apache/arrow/pull/10793#discussion_r685358176

> [C++][Compute] Consider removing ScalarAggregateKernel
> ------------------------------------------------------
>
>                 Key: ARROW-13451
>                 URL: https://issues.apache.org/jira/browse/ARROW-13451
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Ben Kietzman
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> Scalar aggregation does not incur large memory overhead for the associated 
> KernelState objects, so maybe it'd be acceptable to remove explicit scalar 
> aggregation kernels in favor of reusing grouped aggregation kernels with a 
> single group. This would decrease our maintenance burden significantly, and 
> if the benchmarks don't show a regression for single-group aggregation then 
> there's no reason not to.
> Even if there is a performance regression we could bundle the scalar and 
> grouped aggregate kernels in the same compute::Function and decide between 
> them in Dispatch*, rather than confusingly defining distinct "sum" and 
> "hash_sum" functions



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (ARROW-13451) [C++][Compute] Consider removing ScalarAggregateKernel

Reply via email to