RE: Kmeans runs successfully, but no map/reduce jobs

Dave Gettier Thu, 28 Jul 2011 04:34:58 -0700

I assume that since it runs successfully, the input Vectors should be fine. The 
input has "n" number of clusters; the output shows these correctly.

Note:
The created sequence files are stored locally, not on HDFS. When I ran the 
Seinfeld Episodes Mahout clustering example (on the web if you are not familiar 
with it), it "required" the data to be on HDFS to work. Not sure if this point 
is relevant. I am just not sure how to specify HDFS as the output target for 
the PATHs, as opposed to the file system.

Thanks

-----Original Message-----
From: Robin Anil [mailto:[email protected]] 
Sent: Thursday, July 28, 2011 2:25 AM
To: [email protected]
Subject: Re: Kmeans runs successfully, but no map/reduce jobs

Have you verified that the Sequence file in the input folder is having valid
records?

Robin

On Wed, Jul 27, 2011 at 4:20 PM, Dave Gettier <[email protected]>wrote:

>
> I am running a kmeans application which was adapted from  example 7.2 of
> Mahout in Action.  The java program runs successfully,  giving me the
> expected results;  however, there are no map/reduce jobs being kicked off.
>  My understanding was that KMeansCluster runs locally, but KMeansDriver run
> on the cluster.  How does one point the job to run on the cluster? Or am I
> missing something?
>
> KMeansDriver.run(conf,
>               new Path(cp.getsDataDir() +  "/points"),
>               new Path(cp.getsDataDir()  + "/clusters"),
>               new Path(cp.getsDataDir()  +  "/outputs"),
>               new EuclideanDistanceMeasure(),
>               .001,10, true, true);
>
> Thanks in advance!
>
> -DG
>
>

RE: Kmeans runs successfully, but no map/reduce jobs

Reply via email to