Check out the streaming k-means code. It provides capabilities for weighted samples.
On Sat, Aug 10, 2013 at 6:57 AM, William Moran <[email protected]> wrote: > Hi, > > How would I go about changing the weighting of certain words when preparing > data for kmeans? > > Also, in clusterdumps I have already made, some of my clusters are marked > 'VL-' and some are 'CL-'. I believe this is to do with convergence, is it > bad if the clusters have not converged and if so how can I ensure they do > converge? > > Thanks > > (P.S. I did send a question similar to this a while ago but I'm not sure it > worked) >
