On Thu, Jan 19, 2012 at 7:44 AM, Gael Varoquaux <[email protected]> wrote: > On Wed, Jan 18, 2012 at 11:37:15PM +0100, Andreas wrote: >> Having this feature might get us a LOT of attention. >> But this is really not a simple project. > > Before trying to jump to the super fancy features, I'd rather have a > polished and versatile version of the scikit. They are many things that I > find that we haven't explored right. For instance these are my personal > pain points: > > * we don't have an online learning framework. > > * Our model selection framework is still weak > > - see > > https://github.com/scikit-learn/scikit-learn/pull/443#issuecomment-3231270 > > - also, it the difficulty to do nested cross-validation with a specific > cross-validation strategy, > > * we are light on the semi-supervised API > > * our parameter naming is not uniform-enough across models.
Also the scikit has a bias towards dense data. It would be nice if more estimators could work with sparse data too. Mathieu ------------------------------------------------------------------------------ Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
