@Marko, Subject: Streaming KMeans See http://stackoverflow.com/questions/17272296/how-to-use-mahout-streaming-k-means/18090471#18090471 for how to invoke Streaming Kmeans
Also look at examples/bin/cluster-reuters.sh for the Streaming KMeans option. On Wed, Sep 24, 2014 at 11:34 AM, Marko <[email protected]> wrote: > Hello everyone, > > I'm very sorry to bump in like this, I have been added to the mail list (I > think), but it seems that I'm somehow unable to ask a question, that is, I > asked a question full times and got no answer. I hope this way will work. > > I'm new to Mahout and I've been struggling with Streaming K-means for a > while now. Is there any tutorial or example of how to use it, how to get > results, how to call clustering function? > > Any help would be great, > Thanks > > > On 24.09.2014. 15:14, Arian Pasquali wrote: > >> Yes, >> I'm studying his work <http://nlp.uned.es/~jperezi/Lucene-BM25/> and the >> current mahout's tfidf code. >> Trying to understand how I would port that to mr. >> I ll try to share something if I succeed. >> >> >> >> >> >> Arian Pasquali >> http://about.me/arianpasquali >> >> 2014-09-24 5:12 GMT+01:00 Suneel Marthi <[email protected]>: >> >> Lucene 4.x supports okapi-bm25. So it should be easy to implement. >>> >>> On Tue, Sep 23, 2014 at 11:57 PM, Ted Dunning <[email protected]> >>> wrote: >>> >>> Should be pretty easy. I haven't heard of anyone doing it. >>>> >>>> Sent from my iPhone >>>> >>>> On Sep 23, 2014, at 18:53, Arian Pasquali <[email protected]> >>>>> >>>> wrote: >>>> >>>>> Hi, >>>>> I was wondering if would be possible to support bm25 term weighting >>>>> extending Mahout's tf-idf implementation. >>>>> >>>>> I was curious to know if anyone here has already tried to do so. >>>>> If not, what would be your suggestion for such implementation on >>>>> >>>> Mahout? >>> >>>> >>>>> Arian Pasquali >>>>> http://about.me/arianpasquali >>>>> >>>> >
