Hi Oscar,
It would help a lot if you could provide a bit more information on the
data that you wish to cluster, particularly the dimensionality of each
record and the number of records. Also please note that Mahout's k-means
implementation runs in a batch mode on Hadoop so integrating this with
your hibernate data set would require you to dump the SQL data into
Hadoop sequence files in order cluster it. The result sequence files
would then need to be loaded back into hibernate. Depending how your web
application uses the clustered data, this may not meet your performance
requirements.
If not, you may want to look at Weka
(http://www.cs.waikato.ac.nz/ml/weka) instead, also Ted Dunning has
recently posted links to a single pass k-means implementation that you
might be able to integrate into your web application.
On 4/16/12 1:27 AM, OSCAR wrote:
Hello
My name is Oscar González, i'm studing System Engeineer in the universidad el
Bosque from Colombia. And i have the next question:
I have a web aplication with hibernate. And i need use clustering, kmeans
algorithm. I wanna use mahout, but I don't know, how can I apply mahout in my
project... I'm using netbeans. Please answer me,
Thanks
Oscar Miguel Gonzalez
Enviado desde mi iPad