Re: Mahout clustering

2014-12-16 Thread 万代豊
Hi Shweta I guess I can handle this. I always specify namedVector option when generation term vector(seq2sparse) as; $MAHOUT_HOME/bin/mahout seq2sparse --namedVector -i MyJob/MyJob-seqfile/ -o MyJob/MyJob-namedVector -ow -a org.apache.lucene.analysis.WhitespaceAnalyzer -chunk 200 -wt tfidf -s 5 -m

Advise needed for Mahout heap size allocation (seq2sparse failure)

2014-12-16 Thread 万代豊
Hi After my several successful jobs experiences on other Mahout Kmeans calculation in the past , I'm facing a sudden heap error as below in Mahout seq2sparse process.(Mahout-0.70 on Hadoop-0.20.203 Pseudo-distributed) [hadoop@localhost TEST]$ $MAHOUT_HOME/bin/mahout seq2sparse --namedVector -i TES

Re: Question about choice of a recommender

2014-12-16 Thread Ted Dunning
How much data are you going to be collecting? How many users and how many presentations per user? Are you saying that the product for each video are completely fixed? Does the same product appear for more than one video? Do users interact with products outside of the narrow confines that you ha

Re: How can I include mahout 0.9 with hadoop 2.3 in my project?

2014-12-16 Thread Lee S
yep, I just read the pom.xml carefully, you are right, -Dhadoop2 is redundant. 2014-12-16 21:24 GMT+08:00 Gokhan Capan : > > I believe -Dhadoop2 is also redundant. > > mvn clean install -Dhadoop.version=2.3.0 should be sufficient > > Sent from my iPhone > > > On Dec 16, 2014, at 12:29, Lee S wrot

RE: Providing classification labels to Naive Bayes

2014-12-16 Thread Andrew Palumbo
Hi Suman, Attachments don't come through on the user list. Would you mind starting a Jira issue for this with an small example of your data and the error that you're receiving? This may be a feature that was not fully implemented in the most recent MapReduce version of Naive Bayes. Thanks,

RE: Providing classification labels to Naive Bayes

2014-12-16 Thread Suman Somasundar
Hi, Attached is the sample dataset. I using the latest version of Mahout (downloaded from git). Thanks, Suman. -Original Message- From: jyotiranjan panda [mailto:tell2jy...@gmail.com] Sent: Sunday, December 14, 2014 7:23 PM To: user@mahout.apache.org Subject: Re: Providing classificati

Re: How can I include mahout 0.9 with hadoop 2.3 in my project?

2014-12-16 Thread Gokhan Capan
I believe -Dhadoop2 is also redundant. mvn clean install -Dhadoop.version=2.3.0 should be sufficient Sent from my iPhone > On Dec 16, 2014, at 12:29, Lee S wrote: > > Hi all , I have figured this out. > The command should be (mvn clean install -Dhadoop2 -Dhadoop.version=2.3.0 > -DskipTests=true

Re: How can I include mahout 0.9 with hadoop 2.3 in my project?

2014-12-16 Thread Lee S
Hi all , I have figured this out. The command should be (mvn clean install -Dhadoop2 -Dhadoop.version=2.3.0 -DskipTests=true ). Because (2.2.0) is in the pom.xml, not hadoop2.version. Hope this can help somebody who meets the same problem. 2014-12-16 15:49 GMT+08:00 Lee S : > > I compiled mahout w