I train on error only, though, my settings are TEFT in dspam.conf--I'll
change this for the next DSPAM setup.
I've never fed any corpus to DSPAM and I'm getting between 98 and 99
percent success rate, if not higher.
I use DSPAM on a per user bases--no global user. I use spamassassin
(bayes) as a global spam filter.
I train individual messages by looping through each in every user's
'.spam' folder to which they move unmarked spam and to which the imap
client moves marked spam. I do not train spam marked by DSPAM as spam, I
skip and delete them.
I never have trained ham.

The above scenario has brought success at 2 sites, however, it was done
while I was relatively inexperienced with DSPAM.

Eric


 
On 7/24/2015 11:17 AM, waterdog wrote:
> Okay, I apologize for all the following questions but, the more I
> troubleshoot dspam without progress, the more questions I have.
>
> Are there recommendations/documentation on how to properly train?  It seems
> that some users do corpus training and other users just train based on
> actual messages.
>
> What are the pros/cons of using a corpus vs. actual messages?
>
> Does it help to retrain multiple times using the same corpus and/or
> messages?
>
> What are the specific stats that one should look to achieve to determine if
> dspam has had enough training?
>
> Does TL need to be at zero before dspam will work at all?
>
> Do you have to train separately for each user or can all users share the
> same training?  
>
> I've tried training and retraining multiple times using corpuses and actual
> messages but don't seem to be making any real progress.  Here are my current
> stats after training with a corpus:
>
> sudo dspam_train <username> spam_2 easy_ham_2
>
> sudo dspam_stats -H <username>
>
>                 TP True Positives:                     0
>                 TN True Negatives:                  1315
>                 FP False Positives:                 2443
>                 FN False Negatives:                 2154
>                 SC Spam Corpusfed:                     0
>                 NC Nonspam Corpusfed:                  0
>                 TL Training Left:                      0
>                 SHR Spam Hit Rate                  0.00%
>                 HSR Ham Strike Rate:              65.01%
>                 PPV Positive predictive value:     0.00%
>                 OCA Overall Accuracy:             22.24%
>
> As you can see, the OCA is still low but better than it was before.
>
> It might help if someone could post working configurations for postfix,
> dspam, dovecot, and clamAV for comparison.  I've tried to follow the online
> documentation but apparently I'm missing something.
>
>
>
>
> --
> View this message in context: 
> http://dspam-users.2290790.n4.nabble.com/dspam-not-training-tp4641961p4641980.html
> Sent from the dspam users mailing list archive at Nabble.com.
>
> ------------------------------------------------------------------------------
> _______________________________________________
> Dspam-user mailing list
> Dspam-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspam-user



------------------------------------------------------------------------------
_______________________________________________
Dspam-user mailing list
Dspam-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspam-user

Reply via email to