I also thought in the beginning that using KMeansClusterer and
ClusterDumper will help in getting all vectors belonging to a cluster,
but it did not help me a lot.

I used KMeansDriver which I think is easy enough to use.

After execution the records are written in the form
<cluster id><vector>

"context.write(new Text(cluster.getIdentifier()), cluster);"

So, what helped me was to process this into a map with cluster Id as the
key and vector list as the value. I read the clustered points and all
the data in the map in the form. In the end, the list against each
cluster id was what I needed.

Hope this helps.

Regards,
Paritosh

On 03-11-2011 14:23, WangRamon wrote:
>
>
>
> Hi All I'm using KMeansClusterer, I will use KMeansDriver on a Hadoop 
> environment later, but I think it will be easy to understand it by using 
> KMeansClusterer, OK, so the question is i cannot find a way to find the 
> cluster a point should belong to after running KMeansClusterer, I expect I 
> can get some API on the Cluster interface to get all points/vector belong to 
> this cluster, but... so did i miss something? Thanks a lot.  Cheers Ramon     
>                                    
>
>
> -----
> No virus found in this message.
> Checked by AVG - www.avg.com
> Version: 10.0.1411 / Virus Database: 2092/3992 - Release Date: 11/02/11

Reply via email to