Sun Rui created SPARK-12235:
-------------------------------
Summary: Enhance mutate() to support replace existing columns
Key: SPARK-12235
URL: https://issues.apache.org/jira/browse/SPARK-12235
Project: Spark
Issue Type: Improvement
Components: SparkR
Affects Versions: 1.5.2
Reporter: Sun Rui
mutate() in the dplyr package supports adding new columns and replacing
existing columns. But currently the implementation of mutate() in SparkR
supports adding new columns only.
Also make the behavior of mutate more consistent with that in dplyr.
1. Throw error message when there are duplicated column names in the DataFrame
being mutated.
2. when there are duplicated column names in specified columns by arguments,
the last column of the same name takes effect.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]