Re: [Scikit-learn-general] Scikit-learn for large datasets?

2013-08-23 Thread helge.reike...@gmail.com
y NMF on the data. Do you think the scikit-learn implementation could work with 100,000 sparse features on 1 billion rows? Regards, Helge On Fri, Aug 23, 2013 at 12:37 PM, Olivier Grisel wrote: > 2013/8/23 [email protected] : > > Good day, > > > > Can anyone perhaps gi

[Scikit-learn-general] Scikit-learn for large datasets?

2013-08-23 Thread helge.reike...@gmail.com
Good day, Can anyone perhaps give me an idea of how large datasets scikit-learn algorithms typically can handle? I have about 4 TB of structured data. I might be able to normalize that down to say 1 TB if necessary. The tasks would typically be logistic regression, Naive Bayes, k-Means and possib