I am wondering... I have a bunch of ham mail that is in mbox format.

Should I use dspam_train and run through say 2500 ham messages? Or shoudl
I balance that out with spam as well? Or just let the training happen on
incoming new messages?

Stats so far:

 TP True Positives:                  1013
                TN True Negatives:                   643
                FP False Positives:                   43
                FN False Negatives:                  125
                SC Spam Corpusfed:                     0
                NC Nonspam Corpusfed:                  0
                TL Training Left:                   1814
                SHR Spam Hit Rate                 89.02%
                HSR Ham Strike Rate:               6.27%
                PPV Positive predictive value:    95.93%
                OCA Overall Accuracy:             90.79%


------------------------------------------------------------------------------
_______________________________________________
Dspam-user mailing list
Dspam-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspam-user

Reply via email to