[jira] [Commented] (SPARK-14088) Some Dataset API touch-up

Apache Spark (JIRA) Tue, 22 Mar 2016 18:23:43 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-14088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15207674#comment-15207674
 ]


Apache Spark commented on SPARK-14088:
--------------------------------------

User 'rxin' has created a pull request for this issue:
https://github.com/apache/spark/pull/11908

> Some Dataset API touch-up
> -------------------------
>
>                 Key: SPARK-14088
>                 URL: https://issues.apache.org/jira/browse/SPARK-14088
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>            Reporter: Reynold Xin
>            Assignee: Reynold Xin
>             Fix For: 2.0.0
>
>
> 1. Deprecated unionAll. It is pretty confusing to have both "union" and 
> "unionAll" when the two do the same thing in Spark but are different in SQL.
> 2. Rename reduce in KeyValueGroupedDataset to reduceGroups so it is more 
> consistent with rest of the functions in KeyValueGroupedDataset. Also makes 
> it more obvious what "reduce" and "reduceGroups" mean. Previously it was 
> confusing because it could be reducing a Dataset, or just reducing groups.
> 3. Added a "name" function, which is more natural to name columns than "as" 
> for non-SQL users.
> 4. Remove "subtract" function since it is just an alias for "except".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-14088) Some Dataset API touch-up

Reply via email to