Hi, You could try using MiniBatchKMeans and feed the generator to the partial_fit method.
On Thu, Oct 30, 2014 at 4:37 PM, Dzung Nguyen <dzungn...@gmail.com> wrote: > I'm running bag of words model on a big dataset of images. It generates a > huge raw feature matrix ( before doing kmeans to get codebook) which can't > be stored in memory. Can I use generator/yield in python and feed that to > kmeans object? > > Thanks, > -- > *Dzung Nguyen* > > PhD Student > Electrical Engineering and Computer Science, > Northwestern University, IL, USA > http://users.eecs.northwestern.edu/~dtn419/ > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Scikit-learn-general mailing list > Scikit-learn-general@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/scikit-learn-general > > -- Godspeed, Manoj Kumar, Intern, Telecom ParisTech Mech Undergrad http://manojbits.wordpress.com
------------------------------------------------------------------------------
_______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general