[ 
https://issues.apache.org/jira/browse/MAHOUT-817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13158196#comment-13158196
 ] 

Dmitriy Lyubimov edited comment on MAHOUT-817 at 11/28/11 5:52 AM:
-------------------------------------------------------------------

Computation of m*omega also may be fairly involved because even that is vector 
matrix multiplication, Omega is dense, bigger than input, even though we don't 
have to move its input around. Maybe for big inputs we can just take a math 
expectation in of this. For the uniform distribution of murmur(is it uniform?) 
we perhaps can ignore the whole m x Omega because it converges on 0 per law of 
big numbers.
                
      was (Author: dlyubimov):
    Computation of m*omega also may be fairly involved because even that is 
vector matrix multiplication, Omega is dense, bigger than input, even though we 
don't have to move its input around. Maybe for big inputs we can just take a 
math expectation in of this. For the uniform distribution of murmur(is it 
uniform?) We perhaps can I ore the whole m x Omega because it converges on 0 
per law of big numbers.
                  
> Add PCA options to SSVD code
> ----------------------------
>
>                 Key: MAHOUT-817
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-817
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.6
>            Reporter: Dmitriy Lyubimov
>            Assignee: Dmitriy Lyubimov
>             Fix For: Backlog
>
>
> It seems that a simple solution should exist to integrate PCA mean 
> subtraction into SSVD algorithm without making it a pre-requisite step and 
> also avoiding densifying the big input. 
> Several approaches were suggested:
> 1) subtract mean off B
> 2) propagate mean vector deeper into algorithm algebraically where the data 
> is already collapsed to smaller matrices
> 3) --?
> It needs some math done first . I'll take a stab at 1 and 2 but thoughts and 
> math are welcome.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to