[
https://issues.apache.org/jira/browse/SPARK-21209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16312477#comment-16312477
]
Sandeep Kumar Choudhary commented on SPARK-21209:
-------------------------------------------------
I would like to work on it. I have used IPCA of python. I am reading few papers
to figure out the best possible solution. I will get to you on this in few days.
> Implement Incremental PCA algorithm for ML
> ------------------------------------------
>
> Key: SPARK-21209
> URL: https://issues.apache.org/jira/browse/SPARK-21209
> Project: Spark
> Issue Type: New Feature
> Components: ML
> Affects Versions: 2.1.1
> Reporter: Ben St. Clair
> Labels: features
>
> Incremental Principal Component Analysis is a method for calculating PCAs in
> an incremental fashion, allowing one to update an existing PCA model as new
> evidence arrives. Furthermore, an alpha parameter can be used to enable
> task-specific weighting of new and old evidence.
> This algorithm would be useful for streaming applications, where a fast and
> adaptive feature subspace calculation could be applied. Furthermore, it can
> be applied to combine PCAs from subcomponents of large datasets.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]