Re: k-Means questions

Grant Ingersoll Thu, 25 Jun 2009 19:11:55 -0700


On Jun 25, 2009, at 7:00 PM, Ted Dunning wrote:

On Thu, Jun 25, 2009 at 3:49 PM, Grant Ingersoll<[email protected]>wrote:
Do people have recommendations for start clusters (seeds) for k-Means. Thesynthetic control example uses Canopy and I often see Randomselectionmentioned, but I'm wondering what's considered to be best practicesfor
obtaining good overall results.
Just picking a random data element for each centroid should work well.
Random assignment works much less well because all of the centroidsget put
very close to the mean of the entire data set.

I'm confused by these two sentences. They seem contradictory, but I'msure the error is on my end.


-Grant

Re: k-Means questions

Reply via email to