Hi Fernando,

There’s currently no streaming ALS in Spark. I’m exploring a streaming singular 
value decomposition (JIRA) based on this paper 
(http://www.stat.osu.edu/~dmsl/thinSVDtracking.pdf), which might be one way to 
think about it.

There has also been some cool recent work explicitly on streaming ALS w/ SGD 
that we should look into 
(https://www.cs.utexas.edu/~cjohnson/ParallelCollabFilt.pdf).

— Jeremy

-------------------------
jeremyfreeman.net
@thefreemanlab

On Dec 23, 2014, at 2:47 PM, Fernando O. <fot...@gmail.com> wrote:

> Hey Xiangrui,
>   
>     Is there any plan to have a streaming compatible ALS version?
> 
> Or if it's currently doable, is there any example?
> 
> 
> 
> On Tue, Dec 23, 2014 at 4:31 PM, Xiangrui Meng <men...@gmail.com> wrote:
> We have streaming linear regression (since v1.1) and k-means (v1.2) in
> MLlib. You can check the user guide:
> 
> http://spark.apache.org/docs/latest/mllib-linear-methods.html#streaming-linear-regression
> http://spark.apache.org/docs/latest/mllib-clustering.html#streaming-clustering
> 
> -Xiangrui
> 
> On Tue, Dec 23, 2014 at 10:01 AM, Gianmarco De Francisci Morales
> <g...@apache.org> wrote:
> > Hi,
> >
> > I have recently seen a demo of Spark where different pieces were put
> > together (training via MLlib + deploying on Spark Streaming).
> > I was wondering if MLlib currently works to directly train on Streaming.
> > And, if so, what are the semantics of the algorithms?
> > If not, would it be interesting to have ML algorithms developed for the
> > streaming setting?
> >
> > Thanks,
> > --
> > Gianmarco
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> For additional commands, e-mail: user-h...@spark.apache.org
> 
> 

Reply via email to