On Sat, Nov 20, 2010 at 8:47 AM, Mike Perry <[email protected]>wrote:
> Hello all, > > Does the script to convert a Lucene index to Mahout vectors write sequence > files in sparse vector representation? my impression is that it doesn't but > I want to verify that. > Should be sparse, but I can't say for sure. > Also, SparseVectorsFromSequenceFiles is used to convert the vectors to > sparse format (I know about the seq2sparse option). Could someone point out > where in the code it actually constructs the sparse vectors? it seems to > me > that one of the methods in DictionaryVectorizer generates the vectors but I > couldn't > find where exactly. > Look for VectorWritable.
