Hi Dmitriy,

  Just a few comments:

--the computed factors are approximate  A \approx U\SigmaV^{T}

-- the projection steps seemed transposed to me but they are consistent
throughout ie.
(2)  \tilde{u} = \tilde{c}_{r} V \Sigma^{-1}

p. 3:  transpose \xi to emphasize row vector

- 'mean of all rows' is a bit misleading, \xi entries are the mean of each
column  (column-wise mean as you state below)

- dimention -> dimension

I haven't code dived into the new pca code to be familiar with it so the
above comments are just picky notational stuff.  I did however, do some
extensive analysis on the standard decomposition part (as of 0.6 SNAPSHOT)
which can be found here

http://amath.colorado.edu/faculty/martinss/Pubs/2012_halko_dissertation.pdf
(starting page 139)

Its a beast of a read so I can concisely discuss a few points perhaps on a
different thread.



On Wed, Feb 22, 2012 at 6:08 PM, Dmitriy Lyubimov <dlie...@gmail.com> wrote:

> Could somebody please review SSVD command line usage doc before i
> update it on wiki for inclusion of complete nonsense, in particular,
> section $3 where it does overview of PCA and dimensionality reduction
> techniques? Here's the SSVD CLI doc:
>
>
> https://github.com/dlyubimov/mahout-commits/blob/ssvd-docs/SSVD-CLI.pdf?raw=true
>
>
> Thanks.
> -D
>
> On Wed, Feb 22, 2012 at 4:53 PM, Dmitriy Lyubimov <dlie...@gmail.com>
> wrote:
> > Hi,
> > working on PCA section in SSVD usage .
> >
> > Just to confirm, if we run and svd over input with mean subtracted,
> > then U matrix presents original data points converted to PCA space,
> > right?
> >
> > thanks.
> > -d
>

Reply via email to