On Tue, Jan 24, 2012 at 1:27 AM, Olivier Grisel
<[email protected]> wrote:

> I agree although this would be really useful once I am done with the
> hashing text vectorizer. Otherwise the vocabulary dict will explode in
> memory.

Indeed, combined with your hashing text vectorizer, this will allow to
cache the extracted features and thus make several epochs over the
dataset (each epoch being broken down into several calls to
partial_fit).

Mathieu

------------------------------------------------------------------------------
Try before you buy = See our experts in action!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-dev2
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to