Is maybe this contrib what you are looking for? Take a close look to see whether it does what you expect.
http://contrib.scikit-learn.org/imbalanced-learn/auto_examples/over-sampling/plot_smote.html On Tue, Jan 10, 2017 at 6:36 PM, Suranga Kasthurirathne < suranga...@gmail.com> wrote: > > Hi all, > > I apologize - i've been looking for this answer all over the internet, and > it could be that I'm not googling the right terms. > > For managing unbalanced datasets, Weka has SMOTE, and scikit has > randomoversampling. > > In weka, we can ask it to boost by a given percentage (say 100%) so an > undersampled class with 10 values ends up with 20 values (100% increase) > after boosting. > > In Scikit learn, I cant seem to find a way to do this. The > ramdomoversampler boosts arbitrarily. and seem to try to balance the two > classes, which may not be realistic in some cases. > > Can anyone point me to how I can manage boosting percentage using scikit? > > -- > Best Regards, > Suranga > > _______________________________________________ > scikit-learn mailing list > scikit-learn@python.org > https://mail.python.org/mailman/listinfo/scikit-learn > >
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn