RE: Kmeans runs successfully, but no map/reduce jobs

Dave Gettier Thu, 28 Jul 2011 06:50:23 -0700

Can you elaborate on that? Not sure what "--method mapreduce" means.


Some details:

- I have a CSV file. 
- I programmatically create  namedVectors (a java program).
- I run the kmeansdriver which (I thought) submits a number of mapreduce jobs. 
- The output is another sequence file, which I then turn back into CSV.
- All sequence files are stored locally. 


Side note: 
        I am overriding the distance method. 
        I can which distance method via a config file. 
        I can run Euclidian, or my distance function.

-----Original Message-----
From: Grant Ingersoll [mailto:[email protected]] 
Sent: Thursday, July 28, 2011 9:23 AM
To: [email protected]
Subject: Re: Kmeans runs successfully, but no map/reduce jobs

Do you need --method mapreduce passed in?

On Jul 27, 2011, at 4:20 PM, Dave Gettier wrote:

> 
> I am running a kmeans application which was adapted from  example 7.2 of 
> Mahout in Action.  The java program runs successfully,  giving me the 
> expected results;  however, there are no map/reduce jobs being kicked off.    
> My understanding was that KMeansCluster runs locally, but KMeansDriver run on 
> the cluster.  How does one point the job to run on the cluster? Or am I 
> missing something?
> 
> KMeansDriver.run(conf,
>               new Path(cp.getsDataDir() +  "/points"),
>               new Path(cp.getsDataDir()  + "/clusters"),
>               new Path(cp.getsDataDir()  +  "/outputs"),
>               new EuclideanDistanceMeasure(),
>               .001,10, true, true);
> 
> Thanks in advance!
> 
> -DG
> 

--------------------------------------------
Grant Ingersoll

RE: Kmeans runs successfully, but no map/reduce jobs

Reply via email to