Hi, I think the only public corpus I'm aware of is the spamassassin one..
http://spamassassin.apache.org/publiccorpus/ Maybe this can give you a starting point... Bye, Norman 2009/6/19 LUQ <[email protected]> > > please, can somebody help me? > you know to train bayesian algorithm, it takes a lot of time. So i need > somebody answer me as soon as possible. > > LUQ wrote: > > > > Hi all > > I want to train the Bayesian algorithm in James server. > > I can't find the appropriate training dataset to train the Bayesian > > filter. > > Can somebody help me and tell me where I can get the training dataset and > > the test dataset. > > The second thing is, to train the Bayesian algorithm filter in James > > server, should I train it manually, or there is a code for sending the > > e-mails automatically. > > The third thing I want to ask about is can I stop blacklist filter and > use > > the Bayesian filter and vice versa. > > Thank you very much > > > > > > -- > View this message in context: > http://www.nabble.com/some-questions-tp24087773p24112540.html > Sent from the James - Users mailing list archive at Nabble.com. > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >
