Hi All, I am new to Machine learning and wanted to know more about Mahout in general and how we can apply these algortithms to our applications.
I wanted to try out this example: Techcrunch has the company database and also information about what that company does. I was thinking if we can use Mahout's Classifying algorithms which could take these info pages and classify them companies into different categories.. One more thing would be to look at their job description and find out what technologies they are using and classify them. What would be the steps required to get this done.. I tried out Twenty Newsgroups<https://cwiki.apache.org/confluence/display/MAHOUT/Twenty+Newsgroups>example in which case we need to train it. I assume we need to do something like that for the problem described above. Please let me know. Thanks, Shrikar
