[
https://issues.apache.org/jira/browse/ARROW-13309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17391230#comment-17391230
]
Ian Cook commented on ARROW-13309:
----------------------------------
It's common for distributed query engines to have _approximate_ quantile
functions. I believe Spark and Impala have _only_ approximate quantile
functions, no exact versions. Some others like Hive have both, but with
limitations on the exact version (for example in Hive I believe it only works
on integer data.) I think it would be wise for us to implement an approximate
version first: ARROW-13520.
> [C++] Implement hash_aggregate quantile kernel
> ----------------------------------------------
>
> Key: ARROW-13309
> URL: https://issues.apache.org/jira/browse/ARROW-13309
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: David Li
> Priority: Major
>
> Requires ARROW-12759.
> We have a scalar aggregate kernel for this already and hopefully the
> implementation can be reused.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)