[ https://issues.apache.org/jira/browse/SPARK-5026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastián Ramírez closed SPARK-5026. ------------------------------------ You are right. It's OK now. Thanks. > PySpark rdd.randomSpit() is not documented > ------------------------------------------ > > Key: SPARK-5026 > URL: https://issues.apache.org/jira/browse/SPARK-5026 > Project: Spark > Issue Type: Documentation > Components: Documentation, PySpark > Affects Versions: 1.2.0 > Reporter: Sebastián Ramírez > Priority: Minor > Fix For: 1.2.1 > > > In the current latest version of Spark (1.2.0) If you go to the Python API, > in the RDD section, there is no documentation for rdd.randomSplit(): > http://spark.apache.org/docs/latest/api/python/pyspark.html#pyspark.RDD > Nevertheless, it is used as an example in the 1.2.0 documentation for mllib: > http://spark.apache.org/docs/latest/mllib-ensembles.html#regression > (It's in the Python code tab, you can *Ctrl+F* and search for "randomSplit"). > But looking in the code, it seems implemented: > https://github.com/apache/spark/blob/branch-1.2/python/pyspark/rdd.py#L322 -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org