Hi, I have a problem that i would like to implement in mahout clustering.
I have input text documents with data like below. Document1: This is the first document of selling information. Document2: This is the second document of gathering information. I also have another look up file with data like below selling:CatA gathering:CatB. information:CatC NOw i would like to cluster the documents with output being genrated as Document1:CatA,CatC Document2:CatB,CatC Please let me know how to achieve this. Thanks, Subbu
