RE: RE: how to use a custom distance measure with kmeans?

2013-02-19 Thread Mihai Josan
Hello, I managed to resolve the problem without modifying the Mahout script. I inserted my classes into the mahout job jar (mahout-examples-0.7-cdh4.1.2-job.jar) and everything is ok now. Thank you very much for your help, Mihai Josan -Original Message- From: Mihai Josan

Re: What will be the LDAPrintTopics compatible/equivalent feature in Mahout-0.7?

2013-02-19 Thread 万代豊
Well , the --sortVectors for the vectordump utility to evaluate the result for CVB clistering unfortunately brought me OutofMemory issue... Here is the case that seem to goes well without --sortVectors option. $ $MAHOUT_HOME/bin/mahout vectordump -i NHTSA-LDA-sparse -d

Re: OnlineSummarizer and OnlineGaussianAccumulator

2013-02-19 Thread Ted Dunning
I don't think that there are any important differences. On Tue, Feb 19, 2013 at 3:42 AM, Dan Filimon dangeorge.fili...@gmail.comwrote: Are there any differences between these two classes except that OnlineSummarizer is used for scalars and OnlineGaussianAccumulator for vectors? They link to

Re: Problems Running Mahout SSVD

2013-02-19 Thread K.D.P. Ross
Just to follow up: I now have my real data, which, is much sparser than the totally-random data … and, unsurprisingly, it exhibits a good bit more regularity, so it's compressible to the point that the on-disc SequenceFile is small enough that there's only a single map job, which, of course, means

Re: Problems Running Mahout SSVD

2013-02-19 Thread Dmitriy Lyubimov
Well, even with sparse data, your problem is probably still quite small for this. Btw if i have time i will probably put this method into spark rdd and bagel which should speed things up by removing some inevitable sorting overhead. In fact, methinks, having mahout sparse vectors and matrices as

Re: Recommendation based on the attributes

2013-02-19 Thread Ted Dunning
Yes. Recommend the attributes and then do a search for items with those attributes. You can also combine the final retrieval with a normal retrieval. On Tue, Feb 19, 2013 at 9:53 PM, Mohit Anchlia mohitanch...@gmail.comwrote: Is it currently possible to use Mahout Recommendation algorithm

Fwd: Precision used by mahout

2013-02-19 Thread shruti ranade
Hi, I am a beginner in mahout. I am working on k-means MR implementation and trying to run it on a GPGPU.* I wanted to know if mahout computations are all double precision or single precision. * Suggest me any documentation that I need to refer to. Thanks, Shruti