On Tue, Jul 6, 2010 at 9:15 PM, Grant Ingersoll <[email protected]> wrote:

>
> On Jul 6, 2010, at 12:46 PM, Ted Dunning wrote:
>
> > Computing 1000 singular vectors is generally neither necessary nor
> helpful.
>
> OK, good to know.  This is my first time ever running SVD, so I have no
> clue what a useful number is for the rank value.  Advice welcome here.


My rule of thumb has been that for text type stuff (i.e. LSI/LSA), something
around 200-400 is the most you'll ever need.  For smaller corpora and/or
vocabularies, even below the bottom end of this range is fine.

  -jake

Reply via email to