[jira] [Commented] (ARROW-13309) [C++] Implement hash_aggregate quantile kernel

Ian Cook (Jira) Sun, 01 Aug 2021 12:30:07 -0700


    [ 
https://issues.apache.org/jira/browse/ARROW-13309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17391230#comment-17391230
 ]


Ian Cook commented on ARROW-13309:
----------------------------------

It's common for distributed query engines to have _approximate_ quantile 
functions. I believe Spark and Impala have _only_ approximate quantile 
functions, no exact versions. Some others like Hive have both, but with 
limitations on the exact version (for example in Hive I believe it only works 
on integer data.) I think it would be wise for us to implement an approximate 
version first: ARROW-13520.

> [C++] Implement hash_aggregate quantile kernel
> ----------------------------------------------
>
>                 Key: ARROW-13309
>                 URL: https://issues.apache.org/jira/browse/ARROW-13309
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: David Li
>            Priority: Major
>
> Requires ARROW-12759.
> We have a scalar aggregate kernel for this already and hopefully the 
> implementation can be reused.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (ARROW-13309) [C++] Implement hash_aggregate quantile kernel

Reply via email to