> Basically I have been saving my spam for sometime based on > your statement and some others. I have tons of good mail and > wanted to give it corss sections to train on. What I could > do is copy a few messages from each folder of the good mail > and then train on it. I have a lot of log reports and status > emails so training is very important and I must say none of > those messages get caught. I also get very few suspects these days. > > If it is working well must I retrain?
No. The golden rule is that if it's working then you should leave it as is. I'm just warning that you'll see some really strange scores with such a huge imbalance. All testing has shown that training regimes like 'train on error' (i.e. false positives, false negatives and unsures) do much better than bulk training, and that generally smaller databases are more accurate than large ones. > Does Spambayes learn from experience or only from training? If you mean does it do any automatic training, then no it does not. =Tony.Meyer -- Please always include the list (spambayes at python.org) in your replies (reply-all), and please don't send me personal mail about SpamBayes. http://www.massey.ac.nz/~tameyer/writing/reply_all.html explains this. _______________________________________________ [email protected] http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html
