Re: LDA from Lucene Indexes

Jake Mannix Wed, 04 May 2011 11:32:32 -0700

On Wed, May 4, 2011 at 10:46 AM, Ted Dunning <[email protected]> wrote:


> Pipelining is good for abstraction and really bad for performance (in the
> map-reduce world).
>
> My thought is that we could have a multipurpose tool.  Input would be a
> lucene index and the program would read term vectors or original text as
> available.  Output would be either sequence file full of text or sequence
> file full of vectors.
>

Ok, sure, then this is modifying the lucene.vectors code, not the
seq2sparse code, right?

  -jake

Re: LDA from Lucene Indexes

Reply via email to