[
https://issues.apache.org/jira/browse/SPARK-18069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15600830#comment-15600830
]
Apache Spark commented on SPARK-18069:
--------------------------------------
User 'mortada' has created a pull request for this issue:
https://github.com/apache/spark/pull/15053
> Many examples in Python docstrings are incomplete
> -------------------------------------------------
>
> Key: SPARK-18069
> URL: https://issues.apache.org/jira/browse/SPARK-18069
> Project: Spark
> Issue Type: Documentation
> Components: Documentation
> Affects Versions: 2.0.1
> Reporter: Mortada Mehyar
> Priority: Minor
>
> A lot of the python API functions show example usage that is incomplete. The
> docstring shows output without having the input DataFrame defined. It can be
> quite confusing trying to understand and/or follow the example.
> For instance, the docstring for `DataFrame.dtypes()` is currently
> {code}
> def dtypes(self):
> """Returns all column names and their data types as a list.
>
> >>> df.dtypes
> [('age', 'int'), ('name', 'string')]
> """
> {code}
> when it should really be
> {code}
> def dtypes(self):
> """Returns all column names and their data types as a list.
>
> >>> df = spark.createDataFrame([('Alice', 2), ('Bob', 5)], ['name',
> 'age'])
> >>> df.dtypes
> [('age', 'int'), ('name', 'string')]
> """
> {code}
> I have a pending PR for fixing many of these occurrences here:
> https://github.com/apache/spark/pull/15053
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]