Hi.
I am trying to run clustering using Mahout 0.7.
I am clustering short text documents.
The general framework is first running Canopy clustering,
and later running kmeans using the Canopy centroids as a starting point.
This gave useful result in Mahout 0.6.
The canopy part works fine,
but kmeans in Mahout 0.7 seems to keep the same clusters and not to modify
them between different iterations -
I get the same file names with the same sizes, and the process does not
seem to converge.
It looks to me like a bug, but it might be the planned behavior.
TIA,
Yuval

Reply via email to