[
https://issues.apache.org/jira/browse/SPARK-5026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastián Ramírez closed SPARK-5026.
------------------------------------
You are right. It's OK now. Thanks.
> PySpark rdd.randomSpit() is not documented
> ------------------------------------------
>
> Key: SPARK-5026
> URL: https://issues.apache.org/jira/browse/SPARK-5026
> Project: Spark
> Issue Type: Documentation
> Components: Documentation, PySpark
> Affects Versions: 1.2.0
> Reporter: Sebastián Ramírez
> Priority: Minor
> Fix For: 1.2.1
>
>
> In the current latest version of Spark (1.2.0) If you go to the Python API,
> in the RDD section, there is no documentation for rdd.randomSplit():
> http://spark.apache.org/docs/latest/api/python/pyspark.html#pyspark.RDD
> Nevertheless, it is used as an example in the 1.2.0 documentation for mllib:
> http://spark.apache.org/docs/latest/mllib-ensembles.html#regression
> (It's in the Python code tab, you can *Ctrl+F* and search for "randomSplit").
> But looking in the code, it seems implemented:
> https://github.com/apache/spark/blob/branch-1.2/python/pyspark/rdd.py#L322
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]