KMeansPlusPlusClusterer incorrectly selects initial cluster centers and is
unnecessarily slow
---------------------------------------------------------------------------------------------
Key: MATH-584
URL: https://issues.apache.org/jira/browse/MATH-584
Project: Commons Math
Issue Type: Bug
Affects Versions: 2.2
Environment: All environments
Reporter: Randall Scarberry
Fix For: 2.2.1
The chooseInitialClusters() method declares sum as an int, when it should be
double. It also is quite slow because it contains a lot of unnecessary
computation. I'll attached a patch which corrects the problems.
I found the problems while comparing an optimized implementation of KMeans++
I've been working on with the one in commons math.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira