Hi,

I have a problem that i would like to implement in mahout clustering.

I have input text documents with data like below.

Document1: This is the first document of selling information.
Document2: This is the second document of gathering information.

I also have another look up file with data like below
selling:CatA
gathering:CatB.
information:CatC

NOw i would like to cluster the documents with output being genrated as
Document1:CatA,CatC
Document2:CatB,CatC

Please let me know how to achieve this.

Thanks,
Subbu

Reply via email to