Those are both reasonably large, but not commercial in scale. At Veoh, we had about 10 non-zero elements in our raw data. I think Netflix has 100 million.
On Thu, Jul 7, 2011 at 8:05 PM, Lance Norskog <[email protected]> wrote: > What recommendation datasets, that are available, are considered > "large" by Mahout testing standards? Yahoo KDD Cup is offline, the > Netflix data went under a cloud... > > -- > Lance Norskog > [email protected] >
