Jeff, The 'KMeans' job in SyntheticControl does not run KMeans. Presumably, the idea is to run Canopy (which it does) and then KMeans which it doesn't.
Am I missing something? --benson On Fri, May 29, 2009 at 10:29 AM, Jeff Eastman <[email protected]>wrote: > Benson Margulies wrote: > >> OK, I've got some inputs, I want to run k-means, how do I feed the beast? >> >> >> > Make sure you can run the Synthetic Control example to get everything wired > together correctly: JDK, Hadoop, Mahout. See > http://cwiki.apache.org/MAHOUT/syntheticcontroldata.html. Then write an > input job to convert your data similar to > /Mahout/examples/src/main/java/org/apache/mahout/clustering/syntheticcontrol/canopy/InputDriver.java > and make a new job like > /Mahout/examples/src/main/java/org/apache/mahout/clustering/syntheticcontrol/kmeans/Job.java. > You will have a small adventure and then be operational. > > Have fun, > Jeff >
