There is still a libimseti dataset http://www.occamslab.com/petricek/datawith 17,359,346 ratings. People are scared after the Netflix lawsuit.
On Thu, Jul 7, 2011 at 10:17 PM, Ted Dunning <[email protected]> wrote: > Those are both reasonably large, but not commercial in scale. > > At Veoh, we had about 10 non-zero elements in our raw data. I think > Netflix > has 100 million. > > On Thu, Jul 7, 2011 at 8:05 PM, Lance Norskog <[email protected]> wrote: > > > What recommendation datasets, that are available, are considered > > "large" by Mahout testing standards? Yahoo KDD Cup is offline, the > > Netflix data went under a cloud... > > > > -- > > Lance Norskog > > [email protected] > > >
