[Jprogramming] SVD performance comments and CUR decomp

Scott Locklin Wed, 13 Aug 2014 17:13:24 -0700

I noticed that the c-lapack svd function is very slow and memoryinefficient on my (linux) laptop. The native J version in math/misc/svdis actually many times faster and more memory efficient, so I usedimath/misc/svd instead. Now, this flags a bug in C-lapack for certain,as it isn't so bad when i do the same thing in R, but this also showsthe amazing power of J.

As an addendum, I noticed that R no longer uses dgesvd; they use dgesddinstead, as I guess it is more efficient. Also, both gesvd and nativesvd techniques do produce the same answer.


trn=. 250000 30 $?.1e6#0

load'math/misc/svd'
load'math/lapack'
load'math/lapack/gesvd'

ts=: 6!:2, 7!:2@]
b=.ts 'q2=:svd 10000{.trn'
a=.ts 'q=:gesvd_jlapack_ 10000{.trn'

a%b
29.1044 171.825

If you want to see what I am using SVD for, I've been fiddling withmatrix approximants, CUR decomposition in particular:

https://github.com/locklin/jCUR

CUR decomposition is a 2009 technique for efficiently approximatingmatrices by selecting quasi-random pieces of the original matrix. Ithas utility in dimensionality reduction in the same spirit as PCA, butit is a more interpretable, since you have original rows and columns ofthe matrix. Such things might eventually be incorporated into Jd as away of making sense of large amounts of data. There are other suchtechniques I plan on looking at eventually, but I want to find a gooduse case for CUR first (probably something in portfolio theory).


-SL

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

[Jprogramming] SVD performance comments and CUR decomp

Reply via email to