[ https://issues.apache.org/jira/browse/SPARK-18000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Reynold Xin closed SPARK-18000. ------------------------------- Resolution: Won't Fix Marking this as won't fix, since it looks like combination of count-min sketch & histogram is strictly better than this. > Aggregation function for computing bins (distinct value, count) pairs for > equi-width histograms > ----------------------------------------------------------------------------------------------- > > Key: SPARK-18000 > URL: https://issues.apache.org/jira/browse/SPARK-18000 > Project: Spark > Issue Type: Sub-task > Components: SQL > Affects Versions: 2.1.0 > Reporter: Zhenhua Wang > > This function computes the count for each distinct non-null value of a column. > It returns: > 1. null if the table is empty or all values of the column are null. > 2. (distinct non-null value, frequency) pairs if the number of distinct > non-null values is less than or equal to the specified threshold. > 3. an empty result if the number of distinct non-null values exceeds that > threshold. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org