[
https://issues.apache.org/jira/browse/SPARK-7440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin updated SPARK-7440:
-------------------------------
Summary: Remove distinct operator in favor of Aggregate (was: Binary
processing for SQL Distinct operator)
> Remove distinct operator in favor of Aggregate
> ----------------------------------------------
>
> Key: SPARK-7440
> URL: https://issues.apache.org/jira/browse/SPARK-7440
> Project: Spark
> Issue Type: New Feature
> Components: SQL
> Reporter: Reynold Xin
>
> We can either just rewrite distinct using groupby (i.e. aggregate operator),
> or rewrite it to use the lower level hash/sort.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]