Hi,

I have been working recently with the factor analysis implementation in 
scikit-learn and checked it against its main reference (algorithm 21 in David 
Barber’s book). I noticed there is a difference from the algorithm described in 
the book in the way the SVD results are used. In the book, SVD is performed for 
the scaled data matrix like this: X = USV’, and then the first N columns in U 
and first N elements in S are used to compute the factors in each iteration (N 
being the number of components in the model). However, in scikit-learn code, 
the first N rows in V’ (or first N columns in V) are used instead. Is there any 
specific reason for using V instead of U in this case?

Best regards,
João
------------------------------------------------------------------------------
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration & more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=164703151&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to