On Mon, Jan 23, 2012 at 7:24 PM, Olivier Grisel <[email protected]> wrote: > Have a look at `sklearn.linear_model.SGDClassifier` that supports a > partial_fit method in master that you can call several times with > slices of data. > > BTW: what is the structure of you data in PyTables? Is is mapped to a > scipy.sparse Compressed Sparse Row datastructure? How many features do > you have in your dataset?
Olivier, it would be nice if you could create a large scale sparse example using Numpy's memory mapped arrays. If I remember correctly, you mentioned those in the past but I never saw them actually used in combination with scikit-learn. Mathieu ------------------------------------------------------------------------------ Try before you buy = See our experts in action! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-dev2 _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
