Hi all, I apologize - i've been looking for this answer all over the internet, and it could be that I'm not googling the right terms.
For managing unbalanced datasets, Weka has SMOTE, and scikit has randomoversampling. In weka, we can ask it to boost by a given percentage (say 100%) so an undersampled class with 10 values ends up with 20 values (100% increase) after boosting. In Scikit learn, I cant seem to find a way to do this. The ramdomoversampler boosts arbitrarily. and seem to try to balance the two classes, which may not be realistic in some cases. Can anyone point me to how I can manage boosting percentage using scikit? -- Best Regards, Suranga
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn