Michael Armbrust created SPARK-10380:
----------------------------------------

             Summary: Confusing examples in pyspark SQL docs
                 Key: SPARK-10380
                 URL: https://issues.apache.org/jira/browse/SPARK-10380
             Project: Spark
          Issue Type: Bug
          Components: SQL
            Reporter: Michael Armbrust


There’s an error in the astype() documentation, as it uses cast instead of 
astype. It should probably include a mention that astype is an alias for cast 
(and vice versa in the cast documentation): 
https://spark.apache.org/docs/latest/api/python/pyspark.sql.html#pyspark.sql.Column.astype
 

The same error occurs with drop_duplicates and dropDuplicates: 
https://spark.apache.org/docs/latest/api/python/pyspark.sql.html#pyspark.sql.DataFrame.drop_duplicates
 

The issue here is we are copying the code.  According to [~davies] the easiest 
way is to copy the method and just add new docs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to