Re: Fwd: PCA with ssvd leads to StackOverFlowError

2014-03-06 Thread Kevin Moulart
Hi again, and thanks for the enthousiasm ! I did compile the trunk with the hadoop2 profile and, althoug it didn't work at first because of some Canopy tests not passing, when I skipped the tests it compiled and when I tested it afterward it passed. I used the version I have isntalled, so I just

Mahout with Storm/Spark

2014-03-06 Thread vineet yadav
Hi, I am using Mahout LDA algorithm for Topic Modeling on a huge no of documents(500k or more). Mahout is taking a lot of time, I am looking at other alternatives. I found the link( http://www.oracle.com/technetwork/articles/java/micro-1925135.html), where storm is used with Mallet for real time

Re: Rework our website

2014-03-06 Thread Kevin Moulart
Hi I also prefer the second one. While I'm at it, there are several links that point to absent pages. I just clicked on all the link present on page : http://mahout.apache.org/users/basics/quickstart.html And those links are broken :

Re: Rework our website

2014-03-06 Thread Sebastian Schelter
Thank you very much! Could you create a jira ticket and post the links there? That would be awesome, then we can track that this stuff gets fixed. Best, Sebastian On 03/06/2014 02:58 PM, Kevin Moulart wrote: Hi I also prefer the second one. While I'm at it, there are several links that point

Re: Rework our website

2014-03-06 Thread Kevin Moulart
Here you go : https://issues.apache.org/jira/browse/MAHOUT-1434 First JIRA issue posted, so please tell me if I did something wrong in chosing the categories or anything. 2014-03-06 15:06 GMT+01:00 Sebastian Schelter s...@apache.org: Thank you very much! Could you create a jira ticket and

Re: Rework our website

2014-03-06 Thread Suneel Marthi
I fixed some of the broken links. For some of others eg: TasteCommandline, Recommendationexamples either the pages have not been migrated or the links have to be purged? On Thursday, March 6, 2014 9:07 AM, Sebastian Schelter s...@apache.org wrote: Thank you very much! Could you create a

Re: Rework our website

2014-03-06 Thread Sebastian Schelter
Could you add the missing pages to the jira issue? I'll have a look later. On 03/06/2014 03:25 PM, Suneel Marthi wrote: I fixed some of the broken links. For some of others eg: TasteCommandline, Recommendationexamples either the pages have not been migrated or the links have to be purged?

Re: Fwd: PCA with ssvd leads to StackOverFlowError

2014-03-06 Thread Gokhan Capan
Kevin, From trunk, can you build mahout for hadoop2 using this command: mvn clean package -DskipTests=true -Dhadoop2.version=YOUR_HADOOP2_VERSION Then can you verify that you have the right hadoop jars with the following command: find . -name hadoop*.jar Gokhan On Thu, Mar 6, 2014 at

Re: Rework our website

2014-03-06 Thread Suneel Marthi
There is stuff that needs to be migrated over from the old Web site. See Jira for the details. On Thursday, March 6, 2014 9:45 AM, Sebastian Schelter s...@apache.org wrote: Could you add the missing pages to the jira issue? I'll have a look later. On 03/06/2014 03:25 PM, Suneel Marthi

Re: Fwd: PCA with ssvd leads to StackOverFlowError

2014-03-06 Thread Kevin Moulart
Hi thanks very much it seems to have worked ! Compiling with mvn clean package -Dhadoop2.version=2.0.0-cdh4.6.0 works and I no longer have the error, but then when running tests that used to work with previous install like trainAdaptativeLogistic and then ValidateAdaptativeLogistic, the first

Re: Fwd: PCA with ssvd leads to StackOverFlowError

2014-03-06 Thread Sean Owen
That's gonna be a Guava version problem. I have seen variants of this for a while. Hadoop still uses 11.0.2 even in HEAD and you can often get away with using a later version in a project like this, even though code that executes on Hadoop will use an older Guava than you compiled against. This is

Re: Fwd: PCA with ssvd leads to StackOverFlowError

2014-03-06 Thread Kevin Moulart
Ok so should I try and recompile and change the guava version to 11.0.2 in the pom ? Kévin Moulart 2014-03-06 16:26 GMT+01:00 Sean Owen sro...@gmail.com: That's gonna be a Guava version problem. I have seen variants of this for a while. Hadoop still uses 11.0.2 even in HEAD and you can often

Re: Fwd: PCA with ssvd leads to StackOverFlowError

2014-03-06 Thread Sean Owen
If I'm right, then it will cause compile errors, but then, you just fix those by replacing some Guava constructs with equivalent Java or older Guava code. IIRC it is fairly trivial. And in fact probably should not use Guava 12+ methods for this reason even if compiling against 12+. And in fact I

Re: Fwd: PCA with ssvd leads to StackOverFlowError

2014-03-06 Thread Kevin Moulart
Indeed it causes compile errors : [ERROR] Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:3.1:compile (default-compile) on project mahout-math: Compilation failure [ERROR] /home/myCompny/Downloads/mahout9/math/src/main/java/org/apache/mahout/math/stats/GroupTree.java:[171,31]

Re: Mahout with Storm/Spark

2014-03-06 Thread Ted Dunning
WHich version are you using? On Thu, Mar 6, 2014 at 5:47 AM, vineet yadav vineet.yadav.i...@gmail.comwrote: Hi, I am using Mahout LDA algorithm for Topic Modeling on a huge no of documents(500k or more). Mahout is taking a lot of time, I am looking at other alternatives. I found the link(

Re: Fwd: PCA with ssvd leads to StackOverFlowError

2014-03-06 Thread Ted Dunning
On Thu, Mar 6, 2014 at 7:46 AM, Kevin Moulart kevinmoul...@gmail.comwrote: [ERROR] /home/myCompny/Downloads/mahout9/math/src/main/java/org/apache/mahout/math/stats/GroupTree.java:[171,31] cannot find symbol Replace that line with: stack = new ArrayDequeGroupTree();

Re: Rework our website

2014-03-06 Thread Scott C. Cote
Ok - I expected (and am actually pleased that its not a free-for-all. I’ll see what has already been updated in this latest flurry of updates and see what I can contribute. Forwarded to you. Thanks, SCott On 3/5/14, 4:43 PM, Sebastian Schelter s...@apache.org wrote: At the moment, only

Reuters Example LDA Error (no help anywhere)

2014-03-06 Thread Cosmin Dumbrava
I don't know if is ok to mail on this address like this but... there is I have executed cluster-reuters.sh from example directory (vers 1.0 SNAPSHOT) and at the end i only get a list of . 21575{0.02:0.6314297270431626,0.03:

Re: Reuters Example LDA Error (no help anywhere)

2014-03-06 Thread Suneel Marthi
The script needs to be corrected to not call vectordump for LDA as vectordump utility (or even clusterdump) are presently not capable of displaying topics and relevant documents. I recall this issue was previously reported by Peyman Faratin post 0.9 release. Ideally Mahout's missing a

[blog post] Comparing Document Classification Functions of Lucene and Mahout

2014-03-06 Thread Koji Sekiguchi
Hello, I just posted an article on Comparing Document Classification Functions of Lucene and Mahout. http://soleami.com/blog/comparing-document-classification-functions-of-lucene-and-mahout.html Comments are welcome. :) Thanks! koji --