KMeansPlusPlusClusterer incorrectly selects initial cluster centers and is 
unnecessarily slow
---------------------------------------------------------------------------------------------

                 Key: MATH-584
                 URL: https://issues.apache.org/jira/browse/MATH-584
             Project: Commons Math
          Issue Type: Bug
    Affects Versions: 2.2
         Environment: All environments
            Reporter: Randall Scarberry
             Fix For: 2.2.1


The chooseInitialClusters() method declares sum as an int, when it should be 
double.  It also is quite slow because it contains a lot of unnecessary 
computation.  I'll attached a patch which corrects the problems.

I found the problems while comparing an optimized implementation of KMeans++ 
I've been working on with the one in commons math. 



--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to