Re: [Scikit-learn-general] GSoC2015 Hyperparameter Optimization topic

Andreas Mueller Thu, 26 Mar 2015 09:10:27 -0700

Hi Matthias.

As far as I know, the main goal for TPE was to support tree-structuredparameter spaces. I am not sure we want to go there yet because of themore complex API.

On non-tree structured spaces, I think TPE performed worse than SMAC and GP.

With regard to your code: There might be touchy legal issues involved ifyou didn't publish your code and we base our implementation on it.If your code is public and BSD / MIT licensed, it would probably be muchsafer. Why don't you just push your code under a permissive license?


Thank you for providing your benchmarks, they might be quite helpful.

Cheers,
Andy



On 03/26/2015 11:17 AM, Matthias Feurer wrote:

Dear Christof, dear scikit-learn team,
This is a great idea, I highly encourage your idea to integrateBayesian Optimization into scikit-learn since automaticallyconfiguring scikit-learn is quite powerful. It was done by the threewinning teams of the first automated machine learning competition:https://sites.google.com/a/chalearn.org/automl/
I am writing this e-mail because our research group on learning,optimization and automated algorithm design(http://aad.informatik.uni-freiburg.de/) is working on very similarthings which might be useful in this context. Some people in our lab(together with some people from other universities)developed aframework for robust Bayesian optimization with minimal externaldependencies. It currently depends on GPy, but this dependency couldbe easily replaced by the scikit-learn GP. It is probably not asleightweight as you want to have it for scikit-learn, but you mightwant to have a look at the source code. I will provide a link as soonas the project is public (which is soon). In the meantime, I can grantread-access to those who are interested. It might be helpful for youto have look at the structure of the module.
Besides these remarks, I think that using a GP is a good way to tunethe few hyperparameters of a single model. Another remark: Instead ofcomparing GPSearchCV to spearmint only, you should also consider theTPE algorithm implemented in hyperopt(https://github.com/hyperopt/hyperopt). You could consider thefollowing benchmarks:
1. Together with a fellow student I implemented a library calledHPOlib, which provides a few benchmarks for hyperparameteroptimization (for example some from the 2012 spearmint paper):https://github.com/automl/HPOlib It is further described in thispaper: http://automl.org/papers/13-BayesOpt_EmpiricalFoundation.pdf2. If you are looking for a small pipeline, you can usesklearn.feature_selection.SelectPercentile with a fixed scoringfunction together with a classification algorithm. It adds a singlehyperparameter which should be a good fit for the GP.
Best regards,
Matthias




------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the
conversation now. http://goparallel.sourceforge.net/


_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/

_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Re: [Scikit-learn-general] GSoC2015 Hyperparameter Optimization topic

Reply via email to