So a little under a billion nonzero entries. A nice test, but not quite record breaking yet!
-jake On Feb 28, 2010 7:33 PM, "Robin Anil" <robin.a...@gmail.com> wrote: 12 GB uncompressed. I am uploading to s3 at the moment regex :) s3://mahout-wikipedia/unigram-tfidf-vectors/part-0000[0-9] On Mon, Mar 1, 2010 at 8:56 AM, Jake Mannix <jake.man...@gmail.com> wrote: > What's the final size...