2012/1/23 Gael Varoquaux <[email protected]>: > On Mon, Jan 23, 2012 at 05:27:10PM +0100, Olivier Grisel wrote: >> Alternatively we could make a vocabulary dict implementation >> based on a redis server. > > That's two mails in a row suggesting to bing the scikit with an advanced > persistence engine: first Dimitrios suggesting to persist to pytables, > and then you suggesting reddis. > > In general, I don't think that it is good to bind the scikit with a > specific store: different stores will suit different people. What we > could do is have a simple API that enables persistence. I would like to > walk slowly in this direction, as it is quite easy to make overly-complex > and specific designs. I think that we need at least one full-working > implementation to study before we can make decisions and start > merging in the scikit.
I am not talking of adding a dependency on a redis client library in scikit-learn but just to make it possible to pass a "vocabulary" argument to the vectorizer that has the same behavior as python defaultdict but would use a redis server to host the data-structure instead. -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel ------------------------------------------------------------------------------ Try before you buy = See our experts in action! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-dev2 _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
