I used the InputDriver that Jeff placed in Utils to convert my input to a SeqFile and ran it through mahout kmeans, now I can specify the 'k' arg. Jeff - I know you tried to tell me, it just didn't sink in until now. :-)
On Fri, Oct 1, 2010 at 7:22 AM, Matt Tanquary <[email protected]> wrote: > I played around with the t1 and t2 until I got a k that I expected > with my small set, but if I want to ensure say 3 clusters on a large > set of data, then how to I use t1 and t2 to set k? Is there a formula > for that? > > On Thu, Sep 30, 2010 at 8:24 PM, Lahiru Samarakoon <[email protected]> wrote: >> Hi Matt, >> >> As Jeff has mentioned earlier, you have to choose t1 and t2 to get the k >> when you are using * syntheticcontrol.kmeans.Job* program. So what you have >> experienced is correct. >> >> Thanks, >> Lahiru >> > > > > -- > Have you thanked a teacher today? ---> http://www.liftateacher.org > -- Have you thanked a teacher today? ---> http://www.liftateacher.org
