Just pushed a bug fix for ngrams. Update your copy. Here is the result with ngram = 2
Correctly Classified Instances : 1995 99.75% Incorrectly Classified Instances : 5 0.25% Total Classified Instances : 2000 ======================================================= Confusion Matrix ------------------------------------------------------- a b <--Classified as 995 5 | 1000 a = pos 0 1000 | 1000 b = neg Default Category: unknown: 2 With some pruning, you will have a decent enough classifier for sentiments
