[ 
https://issues.apache.org/jira/browse/ARROW-14035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17418866#comment-17418866
 ] 

Ian Cook edited comment on ARROW-14035 at 9/22/21, 11:12 PM:
-------------------------------------------------------------

{quote}1. Do we need to compute the same thing of hash_count_distinct but 
without using the hash table from the hash group?
{quote}
Yes. [~lidavidm] if I am missing any nuance here, please let me know :)
{quote}Are we going to offer non hash version for all hash_x functions too? 
(hash_distinct, hash_count, hash_sum)
{quote}
Yes I think we should aim for that (or nearly that; there might be a few 
exceptions where it does not make sense.) Comparing the lists of aggregation 
functions and hash (grouped) aggregation functions in 
[compute.rst|https://github.com/apache/arrow/blob/master/docs/source/cpp/compute.rst],
 they are mostly the same already, with just a few differences. I think this 
issue and ARROW-13309 are the most important two additions to bring these two 
lists closer to parity.


was (Author: icook):
{quote}1. Do we need to compute the same thing of hash_count_distinct but 
without using the hash table from the hash group?
{quote}
Yes
{quote}Are we going to offer non hash version for all hash_x functions too? 
(hash_distinct, hash_count, hash_sum)
{quote}
Yes I think we should aim for that (or nearly that; there might be a few 
exceptions where it does not make sense.) Comparing the lists of aggregation 
functions and hash (grouped) aggregation functions in 
[compute.rst|https://github.com/apache/arrow/blob/master/docs/source/cpp/compute.rst],
 they are mostly the same already, with just a few differences. I think this 
issue and ARROW-13309 are the most important two additions to bring these two 
lists closer to parity.

> [C++][Compute] Implement non-hash count_distinct aggregate kernel
> -----------------------------------------------------------------
>
>                 Key: ARROW-14035
>                 URL: https://issues.apache.org/jira/browse/ARROW-14035
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Ian Cook
>            Assignee: Percy Camilo Triveño Aucahuasi
>            Priority: Critical
>              Labels: kernel
>             Fix For: 6.0.0
>
>
> ARROW-12728 added a {{hash_count_distinct}} hash aggregate kernel, but there 
> is no non-hash {{count_distinct}} aggregate kernel.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to