Zhenhua Wang created SPARK-17997:

             Summary: Aggregation function for counting distinct values for 
multiple intervals
                 Key: SPARK-17997
                 URL: https://issues.apache.org/jira/browse/SPARK-17997
             Project: Spark
          Issue Type: New Feature
          Components: SQL
    Affects Versions: 2.1.0
            Reporter: Zhenhua Wang

This is for computing ndv's for bins in equi-height histograms. A bin consists 
of two endpoints which form an interval of values and the ndv in that interval. 
For computing histogram statistics, after getting the endpoints, we need an agg 
function to count distinct values in each interval.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to