I assume that since it runs successfully, the input Vectors should be fine. The input has "n" number of clusters; the output shows these correctly.
Note: The created sequence files are stored locally, not on HDFS. When I ran the Seinfeld Episodes Mahout clustering example (on the web if you are not familiar with it), it "required" the data to be on HDFS to work. Not sure if this point is relevant. I am just not sure how to specify HDFS as the output target for the PATHs, as opposed to the file system. Thanks -----Original Message----- From: Robin Anil [mailto:[email protected]] Sent: Thursday, July 28, 2011 2:25 AM To: [email protected] Subject: Re: Kmeans runs successfully, but no map/reduce jobs Have you verified that the Sequence file in the input folder is having valid records? Robin On Wed, Jul 27, 2011 at 4:20 PM, Dave Gettier <[email protected]>wrote: > > I am running a kmeans application which was adapted from example 7.2 of > Mahout in Action. The java program runs successfully, giving me the > expected results; however, there are no map/reduce jobs being kicked off. > My understanding was that KMeansCluster runs locally, but KMeansDriver run > on the cluster. How does one point the job to run on the cluster? Or am I > missing something? > > KMeansDriver.run(conf, > new Path(cp.getsDataDir() + "/points"), > new Path(cp.getsDataDir() + "/clusters"), > new Path(cp.getsDataDir() + "/outputs"), > new EuclideanDistanceMeasure(), > .001,10, true, true); > > Thanks in advance! > > -DG > >
