[ 
https://issues.apache.org/jira/browse/SPARK-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14386394#comment-14386394
 ] 

Hrishikesh commented on SPARK-6258:
-----------------------------------

Hi [~josephkb]
I am a newbie to spark and I would like to contribute. Could you assign this 
ticket to me?

> Python MLlib API missing items: Clustering
> ------------------------------------------
>
>                 Key: SPARK-6258
>                 URL: https://issues.apache.org/jira/browse/SPARK-6258
>             Project: Spark
>          Issue Type: Sub-task
>          Components: MLlib, PySpark
>    Affects Versions: 1.3.0
>            Reporter: Joseph K. Bradley
>
> This JIRA lists items missing in the Python API for this sub-package of MLlib.
> This list may be incomplete, so please check again when sending a PR to add 
> these features to the Python API.
> Also, please check for major disparities between documentation; some parts of 
> the Python API are less well-documented than their Scala counterparts.  Some 
> items may be listed in the umbrella JIRA linked to this task.
> KMeans
> * setEpsilon
> * setInitializationSteps
> KMeansModel
> * computeCost
> * k
> GaussianMixture
> * setInitialModel
> GaussianMixtureModel
> * k
> Completely missing items which should be fixed in separate JIRAs (which have 
> been created and linked to the umbrella JIRA)
> * LDA
> * PowerIterationClustering
> * StreamingKMeans



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to