Hello everyone,

I am running an undersampling algorithm from imblearn, and at the back I want 
to train several gbms (catboost, xgb, lgbm, etc) with successivehalving to 
optimize the hyperparams.

Based on imblearn docs, I need to set up the undersampler within a pipeline, 
but that's innefficient, because in theory, I could undersample all 3 folds of 
the training data, store the left out folds with the original distribution, and 
then on those, train the various models with SH.

So my question is: how can I pass data folds (undersampled for training and 
original for the left out) to successive halving (or any hyperparam search for 
that matter)?

I posted a [question on 
stackoverflow](https://stackoverflow.com/questions/79748461/how-to-pass-pre-computed-folds-to-successivehalving-in-sklearn)
 with this issue. Sharing it here in case someone knows.

Thanks a lot!
Best
Sole

Soledad Galli
https://www.trainindata.com/
_______________________________________________
scikit-learn mailing list -- scikit-learn@python.org
To unsubscribe send an email to scikit-learn-le...@python.org
https://mail.python.org/mailman3//lists/scikit-learn.python.org
Member address: arch...@mail-archive.com

Reply via email to