Seems they started to play with streaming algorithms also in Spark and
MLlib.
https://databricks.com/blog/2015/01/28/introducing-streaming-k-means-in-spark-1-2.html

I wonder how much the mini-batch programming model they have fits
traditional streaming algorithms.
Also, I guess the concept of state across the stream does not fit very well
the abstraction of RDDs.

Interesting to read nevertheless.

Cheers,
--
Gianmarco

Reply via email to