I am trying document classification using OpenNLP however my data is highly 
unbalanced (majority class is 97%).  I recognize that I could randomly 
over/under sample the data set, and am reading up on SMOTE and ADASYN (not sure 
how to apply these to OpenNLP).  
Any suggestions on dealing with the highly unbalanced data would be appreciated.
Thanks
- viraf

Reply via email to