2012/7/5 Peter Prettenhofer <[email protected]>: > 2012/7/5 Olivier Grisel <[email protected]>: >> 2012/7/5 Emanuele Olivetti <[email protected]>: >>> On 07/05/2012 08:49 AM, Olivier Grisel wrote: >>>> 2012/7/5 Peter Prettenhofer <[email protected]>: >>>>> ... >>>>> >>>>> I've to check with the competition organizers whether its ok to put >>>>> the source code on github - I'll keep you posted. >>>> If so that would be a great blog post topic. Looking forward to it. >>>> >>> >>> Hi, >>> >>> For what it's worth, I've put the code of my best submission on >>> github: >>> https://github.com/emanuele/kaggle_ops >>> http://www.kaggle.com/c/online-sales/forums/t/2136/the-code-of-my-best-submission >>> > > Emanuele, > > thanks a lot for sharing - that's great! > >> >> Thank you very much Emanuele, the blending code is very useful. >> >> You should blog it IMHO by explaining the various code snippets: >> >> - feature extraction / expansions (e.g. how to handle dates & times as >> features) >> - your visual exploration of which feature to convert to the log scale >> - dealing with missing values >> - blending the outcome of randomized models >> - cross validation and performance evaluation in general (did you do >> any error analysis, e.g. bias and variance using learning curves?) > > Indeed would be great to have a component to generate learning curves > in sklearn - I have some custom code lying around but it's rather > ugly...
I have this gist: https://gist.github.com/2972039 >> It would be great to turn it the blending procedure as either an >> example for scikit-learn (using one of the default toy datasets) or a >> new meta-estimator in a new package (more work required but would >> improve re-usability). > > I totally agree - a stacking estimator would be great too - in my > experience getting stacking right is far more difficult than one would > expect in the first place. +1 -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
