Look at ClusterIterator.iterate(). This will do clustering in memory without any Hadoop. ClusterIterator.iterateSeq will do clustering in a single process from/to Hadoop sequence files but without map/reduce. ClusterIterator.iterateMR uses full Hadoop to do clustering for the same algorithms (k-means, fuzzy-k, Dirichlet), all configured using ClusteringPolicy instances.

On 5/14/12 8:34 AM, Ioan Eugen Stan wrote:
Hi,

Dos mahout offer online clustering out of the box using sequential
clustering (no MapReduce). I'm looking over the code (trunk) and I
found ClusterClassifier but I can't figure out how that works. Any
examples or more docs on this topic?

Thanks,

Reply via email to