Re: Querry regarding use of classifier in Mahout

Ted Dunning Mon, 18 Oct 2010 10:20:25 -0700

Is this on the training data?  Or held-out test data?

If on test data, this is much, much too accurate to be believed.


On Mon, Oct 18, 2010 at 10:14 AM, Robin Anil <[email protected]> wrote:

> Just pushed a bug fix for ngrams. Update your copy. Here is the result with
> ngram = 2
>
> Correctly Classified Instances          :       1995     99.75%
> Incorrectly Classified Instances        :          5      0.25%
> Total Classified Instances              :       2000
>
> =======================================================
> Confusion Matrix
> -------------------------------------------------------
> a     b     <--Classified as
> 995   5     |  1000   a     = pos
> 0     1000  |  1000   b     = neg
> Default Category: unknown: 2
>
>
> With some pruning, you will have a decent enough classifier for sentiments
>

Re: Querry regarding use of classifier in Mahout

Reply via email to