[
https://issues.apache.org/jira/browse/SPARK-12235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15048190#comment-15048190
]
Apache Spark commented on SPARK-12235:
--------------------------------------
User 'sun-rui' has created a pull request for this issue:
https://github.com/apache/spark/pull/10220
> Enhance mutate() to support replace existing columns
> ----------------------------------------------------
>
> Key: SPARK-12235
> URL: https://issues.apache.org/jira/browse/SPARK-12235
> Project: Spark
> Issue Type: Improvement
> Components: SparkR
> Affects Versions: 1.5.2
> Reporter: Sun Rui
>
> mutate() in the dplyr package supports adding new columns and replacing
> existing columns. But currently the implementation of mutate() in SparkR
> supports adding new columns only.
> Also make the behavior of mutate more consistent with that in dplyr.
> 1. Throw error message when there are duplicated column names in the
> DataFrame being mutated.
> 2. when there are duplicated column names in specified columns by arguments,
> the last column of the same name takes effect.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]