Hi, I tried a few things on that in my last blog post on : http://mandubian.com/2014/03/10/zpark-ml-nio-3/ (last part of a tryptic about spark & scalaz-stream)
I built a collaborative filtering and then use it on each RDD of the DStream usingn a transform { rdd => model.predict(rdd)... }. It works but I need to investigate what happens with the model being potentially remoted... Not sure it's good (or not) Pascal On Thu, Mar 20, 2014 at 2:03 AM, Tathagata Das <tathagata.das1...@gmail.com>wrote: > Yes, of course you can conceptually apply machine learning algorithm on > Spark Streaming. However the current MLLib does not yet have direct support > for Spark Streaming's DStream. However, since DStreams are essentially a > sequence of RDDs, you can apply MLLib algorithms on those RDDs. Take a look > at DStream.transform() and DStream.foreachRDD() operations, which allows > you access RDDs in a DStream. You can apply MLLib functions on them. > > Some people have attempted to make a tighter integration between MLLib and > Spark Streaming. Jeremy (cc'ed) can say more about his adventures. > > TD > > > On Sun, Mar 16, 2014 at 5:56 PM, Nasir Khan <nasirkhan.onl...@gmail.com>wrote: > >> hi, I m into a project in which i have to get streaming URL's and Filter >> it >> and classify it as benin or suspicious. Now Machine Learning and Streaming >> are two separate things in apache spark (AFAIK). my Question is Can we >> apply >> Online Machine Learning Algorithms on Streams?? >> >> I am at Beginner Level, Kindly Explain in abit detail and if some one can >> direct me to some good material for me will be greats..... >> >> Thanks >> Nasir Khan. >> >> >> >> -- >> View this message in context: >> http://apache-spark-user-list.1001560.n3.nabble.com/Machine-Learning-on-streaming-data-tp2732.html >> Sent from the Apache Spark User List mailing list archive at Nabble.com. >> > >