On Wed, Jun 18, 2014 at 5:29 PM, Dmitriy Lyubimov <dlie...@gmail.com> wrote:
> > Finally, I don't view our mission as limited to the DSL work. We should > > also accept/generate useful applications of the DSL. > > > > > I thought that's what i said as well. Cool. I had thought that this was consensus, but got a different impression from you last comments. All's well that ends with consensus to misquote the Bard. > I'd be happy if we could demonstrate > a custom e2e application using mahout components for feature extraction, > vectorization, solution and postprocessing in a few month. In that sense > the woefully missing stuff here is feature extraction and frames. > Indeed. I am seriously hoping that my day job drops below 60 hours soon so I can spend some time on the feature extraction part. > real time streaming such as Spark streaming would requite a bit more > thinking, it is not clear to me how we could abstract such capabilities in > engine-independent way at this point, and even if there's much merit in > doing that rather than writing algorithms directly for streaming, if that's > the idea here. But streaming algorithms still can mix in other mahout > components, which would make them quasi-mahout algorithms of second kind i > mentioned. Yeah... I think that we have agreement here as well ... the Spark Streaming application writers are more likely to be consumers of Mahout than suppliers of additional algorithms.