I am trying document classification using OpenNLP however my data is highly unbalanced (majority class is 97%). I recognize that I could randomly over/under sample the data set, and am reading up on SMOTE and ADASYN (not sure how to apply these to OpenNLP). Any suggestions on dealing with the highly unbalanced data would be appreciated. Thanks - viraf
Document Classification with imbalanced data
viraf.bankwa...@yahoo.com.INVALID Wed, 03 Jul 2019 07:22:51 -0700
- TokenNameFinder Viraf Bankwalla
- Re: TokenNameFinder Rodrigo Agerri
- Re: TokenNameFinder Viraf Bankwalla
- Re: TokenNameFinder Rodrigo Agerri
- Document Classification with imbala... viraf.bankwa...@yahoo.com.INVALID
- Re: Document Classification wit... Dan Russ
- Re: Document Classification... viraf.bankwa...@yahoo.com.INVALID
- Re: Document Classifica... Dan Russ
- Re: Document Class... Tommaso Teofili