Hello, I'm new to mahout but have been working with Solr, Carrot2 and clustering documents with the Lingo algorithm. This has worked well for us for clustering small sets of search results, but we are now branching out into wanting to cluster larger sets of documents (millions of documents to 10s of millions of document for now).
Could someone point me in the right direction as to which of the clustering algorithms I should take a look at first (that would be similar to Lingo)? Thanks, Mike
