GitHub user Swatisoni opened a pull request:
https://github.com/apache/madlib/pull/218
Balanced Datasets: Random undersampling with/without replacement
JIRA:MADLIB-1168
Additional Authors:
Orhan Kislal <[email protected]>
This commit implements random undersampling to create a dataset
with balanced classes.
Both with- and without-replacement methods are available.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/Swatisoni/madlib feature/balanced_sets
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/madlib/pull/218.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #218
----
commit 010199cbd2d14f13eca76330d54fc6e29fb9ecee
Author: Swatisoni <[email protected]>
Date: 2017-12-18T23:30:41Z
Balanced Datasets: Random undersampling with/without replacement
JIRA:MADLIB-1168
Additional Authors:
Orhan Kislal <[email protected]>
This commit implements random undersampling to create a dataset
with balanced classes.
Both with- and without-replacement methods are available.
----
---