holdenk created SPARK-13226:
-------------------------------

             Summary: MLLib PowerIteration Clustering depends on deprecated 
KMeans setRuns API
                 Key: SPARK-13226
                 URL: https://issues.apache.org/jira/browse/SPARK-13226
             Project: Spark
          Issue Type: Improvement
          Components: MLlib
            Reporter: holdenk
            Priority: Trivial


The current MLLib PowerIteration clustering implementation sets the number of 
runs inside of the kmeans call to 5 (apparently arbitrary). This should likely 
be replaced with a specific tolerance.

The reference implementation also appears to use a tolerance, so this would 
also be moving closer to the reference implementation ( 
http://www.cs.cmu.edu/~wcohen/ )



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to