Jeff, Could you tell me about what's failing in KMeans and LDA when running on a cluster? I had this working just prior to 0.5 in https://issues.apache.org/jira/browse/MAHOUT-694
Thanks, Drew On Thu, Jun 9, 2011 at 2:01 PM, Jeff Eastman <[email protected]> wrote: > Ahem, KMeans is not busted. It is being maintained by me, at least. The > build-reuters.sh script runs only in local mode on 0.5 and fails in both > KMeans and LDA when run on a cluster. The MIA examples are not always > correct. Most of this has been reported before. > > -----Original Message----- > From: Sean Owen [mailto:[email protected]] > Sent: Thursday, June 09, 2011 12:29 AM > To: [email protected] > Subject: Re: Problems running examples > > (Assuming you are on HEAD,) I think KMeans is busted -- this has come up > before. I don't know if it is being maintained. Anyone who's willing to > step up and fix it is also welcome to overhaul it IMHO. > > On Thu, Jun 9, 2011 at 12:03 AM, Hector Yee <[email protected]> wrote: > >> I got a slightly different error on the next line of KMeansDriver.java >> (running on OS X Snow Leopard) >> >> 11/06/08 16:02:12 INFO compress.CodecPool: Got brand-new compressor >> Exception in thread "main" java.lang.ClassCastException: >> org.apache.hadoop.io.IntWritable cannot be cast to >> org.apache.mahout.math.VectorWritable >> at >> >> org.apache.mahout.clustering.kmeans.RandomSeedGenerator.buildRandom(RandomSeedGenerator.java:90) >> at >> org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:102) >> >> >> On Sun, Jun 5, 2011 at 9:31 PM, Jeff Eastman <[email protected]> wrote: >> >> > IIRC, Reuters used to run on a cluster but no longer does due to some >> > obscure Lucene changes. In 0.5 it only works in local mode. I really hope >> > this can be repaired by 0.6 as Reuters is a key entry point into Mahout >> > clustering for many users. >> > >> >
