I train on error only, though, my settings are TEFT in dspam.conf--I'll change this for the next DSPAM setup. I've never fed any corpus to DSPAM and I'm getting between 98 and 99 percent success rate, if not higher. I use DSPAM on a per user bases--no global user. I use spamassassin (bayes) as a global spam filter. I train individual messages by looping through each in every user's '.spam' folder to which they move unmarked spam and to which the imap client moves marked spam. I do not train spam marked by DSPAM as spam, I skip and delete them. I never have trained ham.
The above scenario has brought success at 2 sites, however, it was done while I was relatively inexperienced with DSPAM. Eric On 7/24/2015 11:17 AM, waterdog wrote: > Okay, I apologize for all the following questions but, the more I > troubleshoot dspam without progress, the more questions I have. > > Are there recommendations/documentation on how to properly train? It seems > that some users do corpus training and other users just train based on > actual messages. > > What are the pros/cons of using a corpus vs. actual messages? > > Does it help to retrain multiple times using the same corpus and/or > messages? > > What are the specific stats that one should look to achieve to determine if > dspam has had enough training? > > Does TL need to be at zero before dspam will work at all? > > Do you have to train separately for each user or can all users share the > same training? > > I've tried training and retraining multiple times using corpuses and actual > messages but don't seem to be making any real progress. Here are my current > stats after training with a corpus: > > sudo dspam_train <username> spam_2 easy_ham_2 > > sudo dspam_stats -H <username> > > TP True Positives: 0 > TN True Negatives: 1315 > FP False Positives: 2443 > FN False Negatives: 2154 > SC Spam Corpusfed: 0 > NC Nonspam Corpusfed: 0 > TL Training Left: 0 > SHR Spam Hit Rate 0.00% > HSR Ham Strike Rate: 65.01% > PPV Positive predictive value: 0.00% > OCA Overall Accuracy: 22.24% > > As you can see, the OCA is still low but better than it was before. > > It might help if someone could post working configurations for postfix, > dspam, dovecot, and clamAV for comparison. I've tried to follow the online > documentation but apparently I'm missing something. > > > > > -- > View this message in context: > http://dspam-users.2290790.n4.nabble.com/dspam-not-training-tp4641961p4641980.html > Sent from the dspam users mailing list archive at Nabble.com. > > ------------------------------------------------------------------------------ > _______________________________________________ > Dspam-user mailing list > Dspam-user@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/dspam-user ------------------------------------------------------------------------------ _______________________________________________ Dspam-user mailing list Dspam-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspam-user