If you have a small number of documents, check out http://search.carrot2.org/
If you have a large number of documents, check the new Mahout streaming k-means clustering. If you have a medium number of documents, check the older Mahout clustering algorithms. These have the advantage of consistent API and command line interfaces. On Mon, Feb 11, 2013 at 6:57 AM, vivek bairathi <[email protected]>wrote: > Hi All, > > I want to use a clustering algorithm for clustering of Documents and its > content. So just want to know which will be the best clustering algorithm > for this as there are many clustering algorithms available and I am > confused which one to use. > > Please help. > > > Thanks in advance. > > -- > Regards, > Vivek Bairathi >
