On Tue, Jul 6, 2010 at 9:15 PM, Grant Ingersoll <[email protected]> wrote:
> > On Jul 6, 2010, at 12:46 PM, Ted Dunning wrote: > > > Computing 1000 singular vectors is generally neither necessary nor > helpful. > > OK, good to know. This is my first time ever running SVD, so I have no > clue what a useful number is for the rank value. Advice welcome here. My rule of thumb has been that for text type stuff (i.e. LSI/LSA), something around 200-400 is the most you'll ever need. For smaller corpora and/or vocabularies, even below the bottom end of this range is fine. -jake
