[ 
https://issues.apache.org/jira/browse/HIVE-23031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17060299#comment-17060299
 ] 

Slim Bouguerra commented on HIVE-23031:
---------------------------------------

I do not see how this can work well given the followings: 
One sketches return an approximate  and user want exact reporting.
Second how you will be mapping the sketching implementation to actual execution 
given that there is multiple sketches algorithms
Finally each sketch algorithm has some parameters like number of buckets etc 
how are you gonna allow the user to inject that.

In a nutshell am saying let's treat whatever sketch you have in mind as a UDF 
and maybe add some as defaults udf that are trusted by the system.

> Add option to enable transparent rewrite of count(distinct) into sketch 
> functions
> ---------------------------------------------------------------------------------
>
>                 Key: HIVE-23031
>                 URL: https://issues.apache.org/jira/browse/HIVE-23031
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Zoltan Haindrich
>            Assignee: Zoltan Haindrich
>            Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to