On Wed, Jun 18, 2014 at 4:58 PM, Ted Dunning <ted.dunn...@gmail.com> wrote:
> My own take is quite similar but a little different. > > ... > > Finally, I don't view our mission as limited to the DSL work. We should > also accept/generate useful applications of the DSL. > > I thought that's what i said as well. I'd be happy if we could demonstrate a custom e2e application using mahout components for feature extraction, vectorization, solution and postprocessing in a few month. In that sense the woefully missing stuff here is feature extraction and frames. real time streaming such as Spark streaming would requite a bit more thinking, it is not clear to me how we could abstract such capabilities in engine-independent way at this point, and even if there's much merit in doing that rather than writing algorithms directly for streaming, if that's the idea here. But streaming algorithms still can mix in other mahout components, which would make them quasi-mahout algorithms of second kind i mentioned.