Michael Armbrust created SPARK-10380:
----------------------------------------
Summary: Confusing examples in pyspark SQL docs
Key: SPARK-10380
URL: https://issues.apache.org/jira/browse/SPARK-10380
Project: Spark
Issue Type: Bug
Components: SQL
Reporter: Michael Armbrust
There’s an error in the astype() documentation, as it uses cast instead of
astype. It should probably include a mention that astype is an alias for cast
(and vice versa in the cast documentation):
https://spark.apache.org/docs/latest/api/python/pyspark.sql.html#pyspark.sql.Column.astype
The same error occurs with drop_duplicates and dropDuplicates:
https://spark.apache.org/docs/latest/api/python/pyspark.sql.html#pyspark.sql.DataFrame.drop_duplicates
The issue here is we are copying the code. According to [~davies] the easiest
way is to copy the method and just add new docs.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]