I am wondering... I have a bunch of ham mail that is in mbox format. Should I use dspam_train and run through say 2500 ham messages? Or shoudl I balance that out with spam as well? Or just let the training happen on incoming new messages?
Stats so far: TP True Positives: 1013 TN True Negatives: 643 FP False Positives: 43 FN False Negatives: 125 SC Spam Corpusfed: 0 NC Nonspam Corpusfed: 0 TL Training Left: 1814 SHR Spam Hit Rate 89.02% HSR Ham Strike Rate: 6.27% PPV Positive predictive value: 95.93% OCA Overall Accuracy: 90.79% ------------------------------------------------------------------------------ _______________________________________________ Dspam-user mailing list Dspam-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspam-user