I am trying document classification using OpenNLP however my data is highly unbalanced (majority class is 97%). I recognize that I could randomly over/under sample the data set, and am reading up on SMOTE and ADASYN (not sure how to apply these to OpenNLP). Any suggestions on dealing with the highly unbalanced data would be appreciated. Thanks - viraf
- TokenNameFinder Viraf Bankwalla
- Re: TokenNameFinder Rodrigo Agerri
- Re: TokenNameFinder Viraf Bankwalla
- Re: TokenNameFinder Rodrigo Agerri
- Document Classification with imbala... [email protected]
- Re: Document Classification wit... Dan Russ
- Re: Document Classification... [email protected]
- Re: Document Classifica... Dan Russ
- Re: Document Class... Tommaso Teofili
