http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5270
[EMAIL PROTECTED] changed: What |Removed |Added ---------------------------------------------------------------------------- Status|REOPENED |ASSIGNED ------- Additional Comments From [EMAIL PROTECTED] 2007-02-14 09:14 ------- best results I've gotten out of the perceptron so far for set 2 have been: # SUMMARY for threshold 5.0: # Correctly non-spam: 67468 99.88% # Correctly spam: 112527 94.49% # False positives: 82 0.12% # False negatives: 6556 5.51% # TCR(l=50): 11.175206 SpamRecall: 94.495% SpamPrec: 99.927% with intermittent cases where it just goes nuts and gives 30% FNs (with most of the scores zeroed). I can't particularly figure out what needs to be done with the parameters to work around it, nor am I keen to sit here trying sets of params in a trial-and-error fashion... I'm going to go back to using the GA, see if I can get better results out of that. if anyone wants a try to fix the perceptron problems for set0 and set2, the full logs are on the zone at: -rw-rw-r-- 2 jm other 1446756927 Feb 7 22:20 /export/home/jm/ftp/spamassassin/masses/spam-full.log -rw-rw-r-- 2 jm other 413512085 Feb 7 22:15 /export/home/jm/ftp/spamassassin/masses/ham-full.log Theo -- I'm pretty sure it's not a GIGO logs problem, since the results for set 1 and set 3 are quite good, and the freqs look good too. actually, I'll upload the freqs for set3, they're worth checking out. ------- You are receiving this mail because: ------- You are the assignee for the bug, or are watching the assignee.