If you have a small number of documents, check out
http://search.carrot2.org/

If you have a large number of documents, check the new Mahout streaming
k-means clustering.

If you have a medium number of documents, check the older Mahout clustering
algorithms.  These have the advantage of consistent API and command line
interfaces.

On Mon, Feb 11, 2013 at 6:57 AM, vivek bairathi <[email protected]>wrote:

> Hi All,
>
> I want to use a clustering algorithm for clustering of Documents and its
> content. So just want to know which will be the best clustering algorithm
> for this as there are many clustering algorithms available and I am
> confused which one to use.
>
> Please help.
>
>
> Thanks in advance.
>
> --
> Regards,
> Vivek Bairathi
>

Reply via email to