Distance Measure in K-Means Java

2014-04-13 Thread Nicklas Nilsson
Hi, I am trying to cluster text with Canopy and K-Means. This is what I have and it works. But I’m curios if I should not somehow run K-Means with Tanimoto and Canopy with Euclidian instead? What is K-Means using in my setup? And why have the parameter for distance measure in KMeansDrivers run

Documentation, Documentation, Documentation

2014-04-13 Thread Sebastian Schelter
Hi, this is another reminder that we still have to finish our documentation improvements! The website looks shiny now and there have been lots of discussions about new directions but we still have some work todo in cleaning up webpages. We should especially make sure that the examples work.

Re: lucene2seq error: field does not exist in the index

2014-04-13 Thread Suneel Marthi
Apologies for the delayed response Terry.  Mahout's presently at Lucene 4.6.1 (both 0.9 and trunk).  The practice so far has been to upgrade to the latest Lucene version right before a planned release. Not sure what has changed in Solr/Lucene 4.7.1. You could try either of 2 things:- a) Is