Document Classification with imbalanced data

[email protected] Wed, 03 Jul 2019 07:22:51 -0700

 I am trying document classification using OpenNLP however my data is highly 
unbalanced (majority class is 97%).  I recognize that I could randomly 
over/under sample the data set, and am reading up on SMOTE and ADASYN (not sure 
how to apply these to OpenNLP).  
Any suggestions on dealing with the highly unbalanced data would be appreciated.
Thanks
- viraf

Document Classification with imbalanced data

Reply via email to