[
https://issues.apache.org/jira/browse/SPARK-4981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14299666#comment-14299666
]
Reza Zadeh commented on SPARK-4981:
-----------------------------------
To be model parallel, we can simply warm-start the current ALS implementation
in org.apache.spark.mllib.recommendation
The work involved would be to expose a warm-start option in ALS, and then redo
training with say 2 iterations instead of 10, with each batch of RDDs.
The stream would be over batches of Ratings.
This should be the simplest option.
> Add a streaming singular value decomposition
> --------------------------------------------
>
> Key: SPARK-4981
> URL: https://issues.apache.org/jira/browse/SPARK-4981
> Project: Spark
> Issue Type: New Feature
> Components: MLlib, Streaming
> Reporter: Jeremy Freeman
>
> This is for tracking WIP on a streaming singular value decomposition
> implementation. This will likely be more complex than the existing streaming
> algorithms (k-means, regression), but should be possible using the family of
> sequential update rule outlined in this paper:
> "Fast low-rank modifications of the thin singular value decomposition"
> by Matthew Brand
> http://www.stat.osu.edu/~dmsl/thinSVDtracking.pdf
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]