Sun Rui created SPARK-12235:
-------------------------------

             Summary: Enhance mutate() to support replace existing columns
                 Key: SPARK-12235
                 URL: https://issues.apache.org/jira/browse/SPARK-12235
             Project: Spark
          Issue Type: Improvement
          Components: SparkR
    Affects Versions: 1.5.2
            Reporter: Sun Rui


mutate() in the dplyr package supports adding new columns and replacing 
existing columns. But currently the implementation of mutate() in SparkR 
supports adding new columns only.

Also make the behavior of mutate more consistent with that in dplyr.
1. Throw error message when there are duplicated column names in the DataFrame 
being mutated.
2. when there are duplicated column names in specified columns by arguments, 
the last column of the same name takes effect.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to