Hi all, I am a new to Mahout and have a usecase. Wanted to know what is the best possible approach to achieve this in Mahout. Usecase I will be having many documents with different paragraphs. The paragraphs are first need to classify as Relevant/Irrelevant. Then further the Relevant paragraphs are classified in different labels. How to achieve this in Mahout ?
Query regarding Naïve Bayes I have executed 20 NewsGroup example in Mahout which was a success. I realized that in this example no separate labels are defined and hence the code classify with the directory name of the documents. I wanted to know how to use custom labels in Naïve Bayes algo. I tried with "-l" option which created "labels" file in HDFS but after then the job failed. What is the critera that Naïve Bayes check to apply the labels to the documents. Some algorithm functioning insights will help. Thanks Stuti Awasthi ::DISCLAIMER:: ---------------------------------------------------------------------------------------------------------------------------------------------------- The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. E-mail transmission is not guaranteed to be secure or error-free as information could be intercepted, corrupted, lost, destroyed, arrive late or incomplete, or may contain viruses in transmission. The e mail and its contents (with or without referred errors) shall therefore not attach any liability on the originator or HCL or its affiliates. Views or opinions, if any, presented in this email are solely those of the author and may not necessarily reflect the views or opinions of HCL or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of authorized representative of HCL is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. Before opening any email and/or attachments, please check them for viruses and other defects. ----------------------------------------------------------------------------------------------------------------------------------------------------
