Re: boost selected dimensions in kmeans clustering

2015-01-15 Thread Ted Dunning
On Thu, Jan 15, 2015 at 5:23 AM, Miguel Angel Martin junquera mianmarjun.mailingl...@gmail.com wrote: My question is:.. Is it better to scale up these dimensions directly in the tf-idf sequence final mix file using this correction factors OR first do scale up in each tf-vectors and

Re: boost selected dimensions in kmeans clustering

2015-01-15 Thread Miguel Angel Martin junquera
hi Ted, Yes. I was considering various possibilities. one of them was this. ( scale up these dimensions, for example,multiplying by a configurable factor correction.) I really want to mix two different vectors from the same documents with different lengths and dictionaries , (perhaps some

Re: boost selected dimensions in kmeans clustering

2015-01-14 Thread Ted Dunning
The easiest way is to scale those dimensions up. On Wed, Jan 14, 2015 at 2:41 AM, Miguel Angel Martin junquera mianmarjun.mailingl...@gmail.com wrote: hi all, I am clustering using kmeans several text documents from distintct sources and I have generated the sparse vectors of each