chentao106 opened a new pull request #117: Implement the MiniBatchKMeansClusterer URL: https://github.com/apache/commons-math/pull/117 Implement the MiniBatchKMeansClusterer and unit test that compare to KMeansPlusPlusClusterer MiniBatchKMeans is a fast clustering algorithm base on KMeans(Refer to Python sklearn.cluster.MiniBatchKMeans) Use a partial points in initialize cluster centers, and mini batch in iterations. It can finish in few seconds when clustering millions of data, and has few differences between KMeans. See https://www.eecs.tufts.edu/~dsculley/papers/fastkmeans.pdf
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
