Hi Oscar,

It would help a lot if you could provide a bit more information on the data that you wish to cluster, particularly the dimensionality of each record and the number of records. Also please note that Mahout's k-means implementation runs in a batch mode on Hadoop so integrating this with your hibernate data set would require you to dump the SQL data into Hadoop sequence files in order cluster it. The result sequence files would then need to be loaded back into hibernate. Depending how your web application uses the clustered data, this may not meet your performance requirements.

If not, you may want to look at Weka (http://www.cs.waikato.ac.nz/ml/weka) instead, also Ted Dunning has recently posted links to a single pass k-means implementation that you might be able to integrate into your web application.


On 4/16/12 1:27 AM, OSCAR wrote:
Hello

My name is Oscar González, i'm studing System Engeineer in the universidad el 
Bosque from Colombia. And i have the next question:

I have a web aplication with hibernate. And i need use clustering, kmeans 
algorithm. I wanna use mahout, but I don't know, how can I apply mahout in my 
project... I'm using netbeans. Please answer me,

Thanks

Oscar Miguel Gonzalez

Enviado desde mi iPad



Reply via email to