Re: machine learning libraries supported

2017-12-07 Thread Simon Elliston Ball
Spark’s ML models are primarily batch in their nature. There is talk about incorporating things like naive bayes and streaming kmeans to structured streaming (which will require some schema work in metron to make sense). These are still open issues not seeing a lot of progress in the spark

Re: machine learning libraries supported

2017-12-07 Thread Otto Fowler
Simon, What do you think a good example of python, spark and MaaS would look like? On December 7, 2017 at 07:56:00, Simon Elliston Ball ( si...@simonellistonball.com) wrote: I would recommend starting out with something like Spark, but the short answer is that anything that will run inside a

Re: machine learning libraries supported

2017-12-07 Thread Martin Andreoni
Hello Simon, thanks for the information. However, why do u affirm that the streaming models are not well suited? You could as some have suggested use spark streaming, but to be honest, the spark ML models are not well suited to streaming use cases Is there a performance problem or how would

Re: machine learning libraries supported

2017-12-07 Thread Simon Elliston Ball
I would recommend starting out with something like Spark, but the short answer is that anything that will run inside a yarn container, so the answer is most ML libraries. Using Spark to train models on the historical store is a good bet, and then using the trained models with model as a

Re: machine learning libraries supported

2017-12-07 Thread Otto Fowler
Right now, you can look at MaaS, for plugging in machine learning services. If you want to use spark, and you have it on your cluster, you could write your own spark drivers and have them pull from the kakfa topics ( indexing for example ) and run your spark stuff there. On December 7, 2017 at