UseConcMarkSweepGC with Mahout

2013-07-02 Thread Aleksei Udatšnõi
JVM tuning section of Mahout in Action book recommends to use the following GC instead of the default one: -XX:+UseParallelGC -XX:+UseParallelOldGC. Has anyone tries running servlets based on Mahout with UseConcMarkSweepGC instead? Latter garbage collector is known for handling PermGen in

Re: UseConcMarkSweepGC with Mahout

2013-07-02 Thread Sean Owen
This is old-ish advice. I tend to favor UseParallelOldGC even on Java 7, over G1GC, even though it may even be a default now? The Old just means it also uses a parallel collector thread on the old generation. In general it's good to make use of increasingly multi-core machines by making GC

PCA using Java Code

2013-07-02 Thread Chirag Lakhani
Hello, I am trying to use the Mahout/Java API to do PCA but I am confused about the write order to do things. To start, I have a list of DenseVectors that I am reading into the code and turning it into a distributed matrix in the following form. DistributedRowMatrix m = new

Re: PCA using Java Code

2013-07-02 Thread Dmitriy Lyubimov
On Tue, Jul 2, 2013 at 1:52 PM, Chirag Lakhani clakh...@zaloni.com wrote: Hello, I am trying to use the Mahout/Java API to do PCA but I am confused about the write order to do things. To start, I have a list of DenseVectors that I am reading into the code and turning it into a distributed

Re: Need help on Mahout

2013-07-02 Thread Sugato Samanta
Hello, Can you please help? I am not able to read data from hadoop while using the package *org.apache.mahout.classifier.sgd.TrainLogistic *but i am able to read from native file system. Is there a way to read data from hadoop and do logistic regression? Thanks, Sugato On Mon, Jul 1, 2013 at