Thanks Andy! Would like to see if there is interest for algorithms such as 1) clustering text in an online fashion (maybe using LSH or sim/min hash) or 2) online clustering of time series. Basically my focus is "online" or real time.
LSH on GPU sounds very interesting and would love to look at the patches. Personally have helped accelerate LSH on TCAMs long ago e.g. http://arxiv.org/abs/1006.3514 .... Is GPU the only hw accel you are looking at or are you considering PCIe FPGA cards too? debo On Tue, May 17, 2016 at 5:27 PM, Andrew Palumbo <[email protected]> wrote: > Welcome, Debojyoti. > We look forward to your contributiins. We are currently working towards > integrating GPU acceleration for our 0.13 release and LSH sounds like a > great addition. Could you tell us some more about what you would like to do? > > Let us know if we can help you get familiar with the mahout code base. We > try to implement algorithms in the math-scala module. > > Thanks, > > Andy > > > > > > -------- Original message -------- > From: Debojyoti Dutta <[email protected]> > Date: 05/17/2016 8:11 PM (GMT-05:00) > To: [email protected] > Subject: [NEW member] Hi > > Hi there, > > Am very interested in contributing to Mahout especially towards fast ML > kernels that can be used for streaming. Have some experience with LSH based > techniques (including hw accel) for clustering and near neighbors based > stuff in general. > > Was chatting with Sunil and he suggested I join the merry band. > > regards > -Debo~ > -- -Debo~
