j143 opened a new pull request #1264: URL: https://github.com/apache/systemds/pull/1264
Background: Due to the accuracy oriented design of the classifiers the performance is hindered with imbalance, which usually overlooks minority class. "Any dataset with an unequal class distribution is technically imbalanced... However a significant ... disproportion among the number of examples of each class of the problem." Page 19, section 2.1 source: https://www.springer.com/gp/book/9783319980737 Similar function implemented elsewhere: Scikit learn - http://glemaitre.github.io/imbalanced-learn/generated/imblearn.under_sampling.TomekLinks.html R docs - https://www.rdocumentation.org/packages/UBL/versions/0.0.6/topics/TomekClassif -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
