Marko, Sorry to be non-responsive.
There is not a good user manual for the streaming k-means software and there are some known scaling pathologies with that code. I myself know some about it, but lack the time currently to provide detailed support. Can you remind me what your interest is? Is this research? Or looking for something more industrial? On Wed, Sep 24, 2014 at 8:34 AM, Marko <[email protected]> wrote: > Hello everyone, > > I'm very sorry to bump in like this, I have been added to the mail list (I > think), but it seems that I'm somehow unable to ask a question, that is, I > asked a question full times and got no answer. I hope this way will work. > > I'm new to Mahout and I've been struggling with Streaming K-means for a > while now. Is there any tutorial or example of how to use it, how to get > results, how to call clustering function? > > Any help would be great, > Thanks > > > On 24.09.2014. 15:14, Arian Pasquali wrote: > >> Yes, >> I'm studying his work <http://nlp.uned.es/~jperezi/Lucene-BM25/> and the >> current mahout's tfidf code. >> Trying to understand how I would port that to mr. >> I ll try to share something if I succeed. >> >> >> >> >> >> Arian Pasquali >> http://about.me/arianpasquali >> >> 2014-09-24 5:12 GMT+01:00 Suneel Marthi <[email protected]>: >> >> Lucene 4.x supports okapi-bm25. So it should be easy to implement. >>> >>> On Tue, Sep 23, 2014 at 11:57 PM, Ted Dunning <[email protected]> >>> wrote: >>> >>> Should be pretty easy. I haven't heard of anyone doing it. >>>> >>>> Sent from my iPhone >>>> >>>> On Sep 23, 2014, at 18:53, Arian Pasquali <[email protected]> >>>>> >>>> wrote: >>>> >>>>> Hi, >>>>> I was wondering if would be possible to support bm25 term weighting >>>>> extending Mahout's tf-idf implementation. >>>>> >>>>> I was curious to know if anyone here has already tried to do so. >>>>> If not, what would be your suggestion for such implementation on >>>>> >>>> Mahout? >>> >>>> >>>>> Arian Pasquali >>>>> http://about.me/arianpasquali >>>>> >>>> >
