Re: Fwd: PCA with ssvd leads to StackOverFlowError

2014-03-07 Thread Kevin Moulart
Perfect ! It works like a charm now ! I'll still be testing after lunch, and let you know if any new problem subsists, but it looks promising ! Thanks you very much ! Kévin Moulart 2014-03-06 19:31 GMT+01:00 Ted Dunning ted.dunn...@gmail.com: On Thu, Mar 6, 2014 at 7:46 AM, Kevin Moulart

mahout command

2014-03-07 Thread Mahmood Naderan
Hi When I run     mahout wikipediaXMLSplitter -d examples/temp/enwiki-latest-pages-articles.xml -o wikipedia/chunks -c 64 I get this error 14/03/07 16:24:13 WARN driver.MahoutDriver: Unable to add class: wikipediaXMLSplitter java.lang.ClassNotFoundException: wikipediaXMLSplitter     at

Re: mahout command

2014-03-07 Thread Suneel Marthi
Mehmood, wikipediaXMLSplitter is not present in driver.classes.default.props. To accomplish what u r trying to do, u can edit src/conf/driver.classes/default/props and add an entry for wikipediaXMLSplitter. org.apache.mahout.text.wikipedia.WikipediaXmlSplitter = wikipediaXmlSplitter :

Re: mahout command

2014-03-07 Thread Mahmood Naderan
In fact,  see this file     src/conf/driver.classes.default.props which is not exactly as what you said. Still I have the same problem. Please see the complete log hadoop@solaris:~/mahout-distribution-0.9$ head -n 5 src/conf/driver.classes.default.props

PCA to improve classification performances

2014-03-07 Thread Kevin Moulart
Hi again, I'm now using Mahout 0.9, and I'm trying to use PCA (via the SSVD) to reduce the dimention of a dataset from 1600+ features to ~100 and then to use the reducted dataset to train a naive bayes model and test it. So here is my workflow : - Transform my CSV into a SequencFile with

Re: PCA to improve classification performances

2014-03-07 Thread Suneel Marthi
Its not clear to me from ur description as to the exact sequence of steps u r running thru, but an SSVD job requires a matrix as input (not a sequencefile of Text, VectorWritables. When u try running a seqdumper on ur SSVD output do u see anything? The next step after u create ur

Re: mahout command

2014-03-07 Thread Mahmood Naderan
FYI, I am trying to complete the wikipedia example from Apache's document https://cwiki.apache.org/confluence/display/MAHOUT/Wikipedia+Bayes+Example   Regards, Mahmood On Friday, March 7, 2014 5:23 PM, Mahmood Naderan nt_mahm...@yahoo.com wrote: In fact,  see this file    

Re: mahout command

2014-03-07 Thread Suneel Marthi
The example as documented on the Wiki should work.  The issue u seem to be running Mahout 0.9 distro that was built with hadoop 1.2.1 profile on a Hadoop 2.3 environment. I don't think that's gonna work. Suggest that you either: a) Switch to a Hadoop 1.2.1 environment b) Work off of present

Re: mahout command

2014-03-07 Thread Mahmood Naderan
That is rather disappointing b) Work off of present Head and build with Hadoop 2.x profile. Can you explain more?   Regards, Mahmood On Friday, March 7, 2014 8:09 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: The example as documented on the Wiki should work.  The issue u seem to

Welcome Andrew Musselman as new comitter

2014-03-07 Thread Sebastian Schelter
Hi, this is to announce that the Project Management Committee (PMC) for Apache Mahout has asked Andrew Musselman to become committer and we are pleased to announce that he has accepted. Being a committer enables easier contribution to the project since in addition to posting patches on JIRA

Re: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Suneel Marthi
Congrats Andrew. On Friday, March 7, 2014 12:13 PM, Sebastian Schelter s...@apache.org wrote: Hi, this is to announce that the Project Management Committee (PMC) for Apache Mahout has asked Andrew Musselman to become committer and we are pleased to announce that he has accepted. Being a

RE: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Martin, Nick
Awesome! Congrats Andrew very well-deserved. -Original Message- From: Sebastian Schelter [mailto:s...@apache.org] Sent: Friday, March 07, 2014 12:13 PM To: user@mahout.apache.org; d...@mahout.apache.org Subject: Welcome Andrew Musselman as new comitter Hi, this is to announce that the

Re: Mahout with Storm/Spark

2014-03-07 Thread vineet yadav
Hi Ted, It is Mahout 0.7. Thanks Vineet Yadav On Thu, Mar 6, 2014 at 11:58 PM, Ted Dunning ted.dunn...@gmail.com wrote: WHich version are you using? On Thu, Mar 6, 2014 at 5:47 AM, vineet yadav vineet.yadav.i...@gmail.com wrote: Hi, I am using Mahout LDA algorithm for Topic Modeling

Re: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Frank Scholten
Congratulations Andrew! On Fri, Mar 7, 2014 at 6:12 PM, Sebastian Schelter s...@apache.org wrote: Hi, this is to announce that the Project Management Committee (PMC) for Apache Mahout has asked Andrew Musselman to become committer and we are pleased to announce that he has accepted.

Re: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Kevin Moulart
Congratulation Andrew !— Sent from Mailbox for iPhone On Fri, Mar 7, 2014 at 6:26 PM, Frank Scholten fr...@frankscholten.nl wrote: Congratulations Andrew! On Fri, Mar 7, 2014 at 6:12 PM, Sebastian Schelter s...@apache.org wrote: Hi, this is to announce that the Project Management Committee

Re: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Andrew Musselman
Thank you for the welcome! Looking forward to it. I have a math background and got started with recommenders by building the first album recommender for Rhapsody ( http://rhapsody.com ) while I was doing web development and web services work for the service. Since then I learned to love/hate

Re: Mahout with Storm/Spark

2014-03-07 Thread Suneel Marthi
a) Upgrade to the latest Mahout version, please move away from 0.7 a lot of lint was cleaned up since then.  b) Seems like u r running the old LDA algorithm that was replaced by CVB in later versions,  try running ur corpus thru CVB once you upgrade to a later version of Mahout. I don't think

Re: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Pavan Kumar N
Congratulations to Andrew. Would be nice to have some information/background on how PMC evaluated Andrew to become committer. Also would be nice what future aspects/algorithms of machine learning is mahout is going to focus on. I have been keen to maintain code for one of the projects and

Re: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Scott C. Cote
I personally am looking forward to the ³advice from the newest ³recommended² committer to hadoop. Congratulations to Mahout team for increasing and growing :) Now back to my using Š. (and hopefully creating something meaningful for you guys) Scott PS: am bootstrapping my Machine Learning

Re: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Gokhan Capan
Welcome and congratulations, Andrew! Sent from my iPhone On Mar 7, 2014, at 20:20, Scott C. Cote scottcc...@gmail.com wrote: I personally am looking forward to the ³advice from the newest ³recommended² committer to hadoop. Congratulations to Mahout team for increasing and growing :) Now

RE: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Saikat Kanjilal
Congrats Andrew, I've taken the coursera course, it was interesting but was hoping it could cover some more in the area of deep learning. Date: Fri, 7 Mar 2014 12:19:52 -0600 Subject: Re: Welcome Andrew Musselman as new comitter From: scottcc...@gmail.com To: user@mahout.apache.org I

Re: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Stevo Slavić
Congratulations and welcome to the team Andrew! On Fri, Mar 7, 2014 at 7:28 PM, Saikat Kanjilal sxk1...@hotmail.com wrote: Congrats Andrew, I've taken the coursera course, it was interesting but was hoping it could cover some more in the area of deep learning. Date: Fri, 7 Mar 2014