On Thu, Dec 22, 2011 at 12:04 PM, Lance Norskog <[email protected]> wrote:

> The Bayes in the examples doesn't work very well in the 20 newsgroups
> example. Something is wrong  in the data ETL, the tuning options, or
> the Bayes implementation.
>
> On Wed, Dec 21, 2011 at 10:18 PM, Ted Dunning <[email protected]>
> wrote:
> > 97% is not correct.  This sounds like you ran it on the training data.
>

@Ted , yes i ran it on the same training data.


> >
> > 63% also sounds low.  I don't know what happened there.
>

Is any one tested same 20newsgrop with SGD and got better results ?

> >
> > On Wed, Dec 21, 2011 at 9:26 PM, Sreejith S <[email protected]>
> wrote:
> >
> >> Hi all,
> >>
> >> I made a comparison between SGD and Bayes classifiers over 20news-bydate
> >> dataset.
> >> http://people.csail.mit.edu/jrennie/20Newsgroups/20news-bydate.tar.gz
> >>
> >> The classifier results and confusion matrix seems a bit confused, since
> it
> >> is said that SGD is better for small datasets and Bayes for large
> datasets.
> >> Pls check my test scenario http://pastebin.com/K0cy0ayk
> >>
> >> It seems that even in small dataset like 20news-bydate Bayes gives 97 %
> >> accuracy and SGD gives 63 % :(
> >> Am i missing something?? Pls clarify.
> >>
> >> Thank You,
> >> --
> >>
> >>
> >> *Sreejith.S*
> >> http://srijiths.wordpress.com/
> >> * *http://sreejiths.emurse.com/
> >>
> >> tweet2sree@twitter <http://tweet2Sree>
> >>
>
>
>
> --
> Lance Norskog
> [email protected]
>



-- 


*Sreejith.S*
http://srijiths.wordpress.com/
* *http://sreejiths.emurse.com/

tweet2sree@twitter <http://tweet2Sree>

Reply via email to