Hi Paul, if you can't wait, a vanilla implementation is already contained as part of the Flink examples. You should find it under flink/flink-examples.
But we will try to add more clustering algorithms in the near future. Cheers, Till On Apr 26, 2015 11:14 PM, "Alexander Alexandrov" < alexander.s.alexand...@gmail.com> wrote: > Yes, I expect to have one in the next few weeks (the code is actually > there, but we need to port it to the Flink ML API). I suggest to follow the > JIRA issue in the next weeks to check when this is done: > > https://issues.apache.org/jira/browse/FLINK-1731 > > Regards, > Alexander > > PS. Bear in mind that we will start with a vanilla implementation of > K-Means. For a thorough evaluation you might want to also check variants > like K-Means++. > > > 2015-04-24 15:08 GMT+02:00 Pa Rö <paul.roewer1...@googlemail.com>: > >> hi flink community, >> >> at the time I write my master thesis in the field machine learning. My >> main task is to evaluated different k-means variants for large data sets >> (BigData). I would like test flink ml against Apache Mahout and Apache >> Hadoop MapReduce in areas of scalability and performance(time and space). >> What is the current state for the purpose of clustering, especially >> K-Means? Will there be in the near future a release information this? >> >> best greetings >> paul >> > >