Is maybe this contrib what you are looking for? Take a close look to see
whether it does what you expect.

http://contrib.scikit-learn.org/imbalanced-learn/auto_examples/over-sampling/plot_smote.html



On Tue, Jan 10, 2017 at 6:36 PM, Suranga Kasthurirathne <
suranga...@gmail.com> wrote:

>
> Hi all,
>
> I apologize - i've been looking for this answer all over the internet, and
> it could be that I'm not googling the right terms.
>
> For managing unbalanced datasets, Weka has SMOTE, and scikit has
> randomoversampling.
>
> In weka, we can ask it to boost by a given percentage (say 100%) so an
> undersampled class with 10 values ends up with 20 values (100% increase)
> after boosting.
>
> In Scikit learn, I cant seem to find a way to do this. The
> ramdomoversampler boosts arbitrarily. and seem to try to balance the two
> classes, which may not be realistic in some cases.
>
> Can anyone point me to how I can manage boosting percentage using scikit?
>
> --
> Best Regards,
> Suranga
>
> _______________________________________________
> scikit-learn mailing list
> scikit-learn@python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>
>
_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to