Hi Shweta
I guess I can handle this.
I always specify namedVector option when generation term vector(seq2sparse)
as;
$MAHOUT_HOME/bin/mahout seq2sparse --namedVector -i MyJob/MyJob-seqfile/ -o
MyJob/MyJob-namedVector -ow -a
org.apache.lucene.analysis.WhitespaceAnalyzer -chunk 200 -wt tfidf -s 5 -m
Hi
After my several successful jobs experiences on other Mahout Kmeans
calculation in the past , I'm facing a sudden heap error as below in Mahout
seq2sparse process.(Mahout-0.70 on Hadoop-0.20.203 Pseudo-distributed)
[hadoop@localhost TEST]$ $MAHOUT_HOME/bin/mahout seq2sparse --namedVector
-i TES
How much data are you going to be collecting? How many users and how many
presentations per user?
Are you saying that the product for each video are completely fixed? Does
the same product appear for more than one video?
Do users interact with products outside of the narrow confines that you
ha
yep, I just read the pom.xml carefully, you are right, -Dhadoop2 is
redundant.
2014-12-16 21:24 GMT+08:00 Gokhan Capan :
>
> I believe -Dhadoop2 is also redundant.
>
> mvn clean install -Dhadoop.version=2.3.0 should be sufficient
>
> Sent from my iPhone
>
> > On Dec 16, 2014, at 12:29, Lee S wrot
Hi Suman,
Attachments don't come through on the user list. Would you mind starting a
Jira issue for this with an small example of your data and the error that
you're receiving? This may be a feature that was not fully implemented in the
most recent MapReduce version of Naive Bayes.
Thanks,
Hi,
Attached is the sample dataset.
I using the latest version of Mahout (downloaded from git).
Thanks,
Suman.
-Original Message-
From: jyotiranjan panda [mailto:tell2jy...@gmail.com]
Sent: Sunday, December 14, 2014 7:23 PM
To: user@mahout.apache.org
Subject: Re: Providing classificati
I believe -Dhadoop2 is also redundant.
mvn clean install -Dhadoop.version=2.3.0 should be sufficient
Sent from my iPhone
> On Dec 16, 2014, at 12:29, Lee S wrote:
>
> Hi all , I have figured this out.
> The command should be (mvn clean install -Dhadoop2 -Dhadoop.version=2.3.0
> -DskipTests=true
Hi all , I have figured this out.
The command should be (mvn clean install -Dhadoop2 -Dhadoop.version=2.3.0
-DskipTests=true ).
Because (2.2.0) is in the pom.xml, not
hadoop2.version.
Hope this can help somebody who meets the same problem.
2014-12-16 15:49 GMT+08:00 Lee S :
>
> I compiled mahout w