Apache Spark commented on SPARK-17997:

User 'wzhfy' has created a pull request for this issue:

> Aggregation function for counting distinct values for multiple intervals
> ------------------------------------------------------------------------
>                 Key: SPARK-17997
>                 URL: https://issues.apache.org/jira/browse/SPARK-17997
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 2.1.0
>            Reporter: Zhenhua Wang
> This is for computing ndv's for bins in equi-height histograms. A bin 
> consists of two endpoints which form an interval of values and the ndv in 
> that interval. For computing histogram statistics, after getting the 
> endpoints, we need an agg function to count distinct values in each interval.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to