Good day,
Can anyone perhaps give me an idea of how large datasets scikit-learn
algorithms typically can handle?
I have about 4 TB of structured data. I might be able to normalize that
down to say 1 TB if necessary. The tasks would typically be logistic
regression, Naive Bayes, k-Means and possib
y NMF on the data. Do you think the scikit-learn
implementation could work with 100,000 sparse features on 1 billion rows?
Regards,
Helge
On Fri, Aug 23, 2013 at 12:37 PM, Olivier Grisel
wrote:
> 2013/8/23 [email protected] :
> > Good day,
> >
> > Can anyone perhaps gi