Re: DenseRowMatrix?

2013-04-18 Thread Gokhan Capan
Ok, I'll do that. Thank you, Jake. On Thu, Apr 18, 2013 at 4:43 AM, Jake Mannix jake.man...@gmail.com wrote: SparseMatrix is implemented as a MapInteger, Vector, you could modify that class to allow you to chose between dense or sparse rows at construction time. On Wed, Apr 17, 2013 at

Re: Feature reduction for LibLinear weights

2013-04-18 Thread Ted Dunning
On Wed, Apr 17, 2013 at 2:29 PM, Ken Krugler kkrugler_li...@transpac.comwrote: Though I haven't yet found a good write-up on the value of generating more than one hash - seems like multiple hash values would increase the odds of collisions. It does. But it also increases the chances of

Re: DenseRowMatrix?

2013-04-18 Thread Ted Dunning
Actually, I built a BlockSparseMatrix some time ago that does this. It was just like our current row sparse matrix except that it allowed for blocks instead of rows. It did assume sparse matrices where the data existed. Worked a charm on the app in question. It doesn't appear that I committed

Re: DenseRowMatrix?

2013-04-18 Thread Gokhan Capan
Ted, Are you going to commit this? On Thu, Apr 18, 2013 at 5:18 PM, Ted Dunning ted.dunn...@gmail.com wrote: Actually, I built a BlockSparseMatrix some time ago that does this. It was just like our current row sparse matrix except that it allowed for blocks instead of rows. It did assume

Re: DenseRowMatrix?

2013-04-18 Thread Ted Dunning
I hadn't planned to since I don't have cycles to properly test it. Let me see if I can dig it back up to see if anybody else wants to. SparseMatrix may be good enough for you. On Thu, Apr 18, 2013 at 2:48 PM, Gokhan Capan gkhn...@gmail.com wrote: Ted, Are you going to commit this? On

Re: DenseRowMatrix?

2013-04-18 Thread Ted Dunning
See MAHOUT-1193. Feel free to grab that code, torture it, and tell us if it is any good. If it is useful, we can commit it. On Thu, Apr 18, 2013 at 3:34 PM, Ted Dunning ted.dunn...@gmail.com wrote: I hadn't planned to since I don't have cycles to properly test it. Let me see if I can dig

Error creating assembly archive job: error in opening zip file

2013-04-18 Thread Philipp Defner
Hello, I'm having some problems with installing Mahout. I setup a clean Ubuntu 12.10 x64 VM with: java -version == java version 1.6.0_27 OpenJDK Runtime Environment (IcedTea6 1.12.3) (6b27-1.12.3-0ubuntu1~12.10.1) OpenJDK 64-Bit Server VM (build 20.0-b12, mixed mode) mvn -version

Re: Error creating assembly archive job: error in opening zip file

2013-04-18 Thread Suneel Marthi
Usually happens if Maven looses connection to the repo from my experience, could u try again? Also r u using Maven 3? From: Philipp Defner m...@notmyhostna.me To: user@mahout.apache.org Sent: Thursday, April 18, 2013 5:22 PM Subject: Error creating assembly

Re: Error creating assembly archive job: error in opening zip file

2013-04-18 Thread Dmitriy Lyubimov
maven 2.2.1 is really archaic at this point methinks. I don't think anybody has been checking if stuff compiles with it for a really long time now. -d On Thu, Apr 18, 2013 at 2:22 PM, Philipp Defner m...@notmyhostna.me wrote: Hello, I'm having some problems with installing Mahout. I setup a

Re: Error creating assembly archive job: error in opening zip file

2013-04-18 Thread Philipp Defner
I'm using Apache Maven 2.2.1 and I don't think it's losing the connection because it's the same error across different servers. On Apr 18, 2013, at 11:31 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Usually happens if Maven looses connection to the repo from my experience, could u try

Re: Error creating assembly archive job: error in opening zip file

2013-04-18 Thread Suneel Marthi
Well I guess its maven 2.2.1 then, upgrade to maven 3+ and give it a shot. From: Philipp Defner m...@notmyhostna.me To: user@mahout.apache.org; Suneel Marthi suneel_mar...@yahoo.com Sent: Thursday, April 18, 2013 5:36 PM Subject: Re: Error creating assembly

Re: Error creating assembly archive job: error in opening zip file

2013-04-18 Thread Philipp Defner
I updated maven to 3.0.4 now but the problem is still around. == Apache Maven 3.0.4 Maven home: /usr/share/maven Java version: 1.6.0_27, vendor: Sun Microsystems Inc. Java home: /usr/lib/jvm/java-6-openjdk-amd64/jre Default locale: en_US, platform encoding: UTF-8 OS name: linux,

Re: Error creating assembly archive job: error in opening zip file

2013-04-18 Thread Dmitriy Lyubimov
Hm. This is really not a known error. Which suggests something really platitudinarian: open file handle limits? lack of disk space? Sorry if that's not really helpful but it is not something i can repeat. On Thu, Apr 18, 2013 at 3:48 PM, Philipp Defner m...@notmyhostna.me wrote: I updated

Re: Error creating assembly archive job: error in opening zip file

2013-04-18 Thread Sean Owen
Probably a corrupt download inside Maven. Delete ~/.m2/repository entirely On Apr 19, 2013 12:23 AM, Dmitriy Lyubimov dlie...@gmail.com wrote: Hm. This is really not a known error. Which suggests something really platitudinarian: open file handle limits? lack of disk space? Sorry if that's not

Re: Error creating assembly archive job: error in opening zip file

2013-04-18 Thread Philipp Defner
Alright, this solved the problem. Thanks Sean! I actually removed the .m2/ directory a few times already (because it was suggested at various places) but forgot about it after I installed maven3 so I guess it's the combination of installing maven 3.x and running a clean install without the old

What's the difference between trainnb and trainclassifier -type bayes?

2013-04-18 Thread Ryan Compton
When I use trainclassifier I am able to run the 20 news groups just fine. I'm also able to train on my own data up until around 10M training documents. Once I have enough training data, I find that trainclassifier succeeds and testclassifier fails. I have no idea if it was a training or testing

Re: What's the difference between trainnb and trainclassifier -type bayes?

2013-04-18 Thread Robin Anil
train classifier was the old implementation. I am assuming you are using a version 0.7. You could either try 0.7 or get the latest source from svn the new implementation works both with seq2sparse (the one that generates tfidf vectors) and the new seq2encoded(the one that generates vectors using