[ 
https://issues.apache.org/jira/browse/SPARK-14088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Reynold Xin updated SPARK-14088:
--------------------------------
    Description: 
1. Deprecated unionAll. It is pretty confusing to have both "union" and 
"unionAll" when the two do the same thing in Spark but are different in SQL.

2. Rename reduce in KeyValueGroupedDataset to reduceGroups so it is more 
consistent with rest of the functions in KeyValueGroupedDataset. Also makes it 
more obvious what "reduce" and "reduceGroups" mean. Previously it was confusing 
because it could be reducing a Dataset, or just reducing groups.

3. Added a "name" function, which is more natural to name columns than "as" for 
non-SQL users.

4. Remove "subtract" function since it is just an alias for "except".





  was:
1. Deprecated unionAll. It is pretty confusing to have both "union" and 
"unionAll" when the two do the same thing in Spark but are different in SQL.

2. Rename reduce in KeyValueGroupedDataset to reduceGroups so it is more 
consistent with rest of the functions in KeyValueGroupedDataset. Also makes it 
more obvious what "reduce" and "reduceGroups" mean. Previously it was confusing 
because it could be reducing a Dataset, or just reducing groups.

3. Added a "name" function, which is more natural to name columns than "as" for 
non-SQL users.






> Some Dataset API touch-up
> -------------------------
>
>                 Key: SPARK-14088
>                 URL: https://issues.apache.org/jira/browse/SPARK-14088
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>            Reporter: Reynold Xin
>            Assignee: Reynold Xin
>             Fix For: 2.0.0
>
>
> 1. Deprecated unionAll. It is pretty confusing to have both "union" and 
> "unionAll" when the two do the same thing in Spark but are different in SQL.
> 2. Rename reduce in KeyValueGroupedDataset to reduceGroups so it is more 
> consistent with rest of the functions in KeyValueGroupedDataset. Also makes 
> it more obvious what "reduce" and "reduceGroups" mean. Previously it was 
> confusing because it could be reducing a Dataset, or just reducing groups.
> 3. Added a "name" function, which is more natural to name columns than "as" 
> for non-SQL users.
> 4. Remove "subtract" function since it is just an alias for "except".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to