[
https://issues.apache.org/jira/browse/HIVE-23031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060299#comment-17060299
]
Slim Bouguerra commented on HIVE-23031:
---------------------------------------
I do not see how this can work well given the followings:
One sketches return an approximate and user want exact reporting.
Second how you will be mapping the sketching implementation to actual execution
given that there is multiple sketches algorithms
Finally each sketch algorithm has some parameters like number of buckets etc
how are you gonna allow the user to inject that.
In a nutshell am saying let's treat whatever sketch you have in mind as a UDF
and maybe add some as defaults udf that are trusted by the system.
> Add option to enable transparent rewrite of count(distinct) into sketch
> functions
> ---------------------------------------------------------------------------------
>
> Key: HIVE-23031
> URL: https://issues.apache.org/jira/browse/HIVE-23031
> Project: Hive
> Issue Type: Sub-task
> Reporter: Zoltan Haindrich
> Assignee: Zoltan Haindrich
> Priority: Major
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)