Hi,

I think the only public corpus I'm aware of is the spamassassin one..

http://spamassassin.apache.org/publiccorpus/

Maybe this can give you a starting point...


Bye,
Norman



2009/6/19 LUQ <[email protected]>

>
> please, can somebody help me?
> you know to train bayesian algorithm, it takes a lot of time. So i need
> somebody answer me as soon as possible.
>
> LUQ wrote:
> >
> > Hi all
> > I want to train the Bayesian algorithm in James server.
> > I can't find the appropriate training dataset to train the Bayesian
> > filter.
> > Can somebody help me and tell me where I can get the training dataset and
> > the test dataset.
> > The second thing is, to train the Bayesian algorithm filter in James
> > server, should I train it manually, or there is a code for sending the
> > e-mails automatically.
> > The third thing I want to ask about is can I stop blacklist filter and
> use
> > the Bayesian filter and vice versa.
> > Thank you very much
> >
> >
>
> --
> View this message in context:
> http://www.nabble.com/some-questions-tp24087773p24112540.html
> Sent from the James - Users mailing list archive at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>

Reply via email to