Good point. On Wed, May 4, 2011 at 11:31 AM, Jake Mannix <[email protected]> wrote:
> On Wed, May 4, 2011 at 10:46 AM, Ted Dunning <[email protected]> > wrote: > > > Pipelining is good for abstraction and really bad for performance (in the > > map-reduce world). > > > > My thought is that we could have a multipurpose tool. Input would be a > > lucene index and the program would read term vectors or original text as > > available. Output would be either sequence file full of text or sequence > > file full of vectors. > > > > Ok, sure, then this is modifying the lucene.vectors code, not the > seq2sparse code, right? > > -jake >
