Look at ClusterIterator.iterate(). This will do clustering in memory
without any Hadoop. ClusterIterator.iterateSeq will do clustering in a
single process from/to Hadoop sequence files but without map/reduce.
ClusterIterator.iterateMR uses full Hadoop to do clustering for the same
algorithms (k-means, fuzzy-k, Dirichlet), all configured using
ClusteringPolicy instances.
On 5/14/12 8:34 AM, Ioan Eugen Stan wrote:
Hi,
Dos mahout offer online clustering out of the box using sequential
clustering (no MapReduce). I'm looking over the code (trunk) and I
found ClusterClassifier but I can't figure out how that works. Any
examples or more docs on this topic?
Thanks,