On Sat, May 05, 2012 at 08:46:23PM -0400, David Warde-Farley wrote: > So, one use case I've used to benchmark some of my own code is Coates et al's > dataset of 400,000 CIFAR10 patches with 108 PCA-whitened dimensions
Sounds like an interesting dataset that could be added to the 'fetch_...' functions. On IRC there is a parallel discussion on another large dataset: http://www.di.unipi.it/~gulli/AG_corpus_of_news_articles.html I wonder whether it would not be interesting to have a wiki page also on datasets. G ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
