Hyukjin Kwon created SPARK-21658:
------------------------------------

             Summary: Adds the default None for value in na.replace in PySpark 
to match
                 Key: SPARK-21658
                 URL: https://issues.apache.org/jira/browse/SPARK-21658
             Project: Spark
          Issue Type: Bug
          Components: PySpark
    Affects Versions: 2.2.0
            Reporter: Hyukjin Kwon
            Priority: Minor


Looks {{na.replace}} missed the default value {{None}}.

Both docs says they are aliases 
http://spark.apache.org/docs/2.2.0/api/python/pyspark.sql.html#pyspark.sql.DataFrame.replace
http://spark.apache.org/docs/2.2.0/api/python/pyspark.sql.html#pyspark.sql.DataFrameNaFunctions.replace

but the default values looks different, which ends up with:

{code}
>>> df = spark.createDataFrame([('Alice', 10, 80.0)])
>>> df.replace({"Alice": "a"}).first()
Row(_1=u'a', _2=10, _3=80.0)
>>> df.na.replace({"Alice": "a"}).first()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: replace() takes at least 3 arguments (2 given)
{code}

To take the advantage of SPARK-19454, sounds we should match them.





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to