http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5270
[EMAIL PROTECTED] changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|REOPENED |ASSIGNED
------- Additional Comments From [EMAIL PROTECTED] 2007-02-14 09:14 -------
best results I've gotten out of the perceptron so far for set 2 have been:
# SUMMARY for threshold 5.0:
# Correctly non-spam: 67468 99.88%
# Correctly spam: 112527 94.49%
# False positives: 82 0.12%
# False negatives: 6556 5.51%
# TCR(l=50): 11.175206 SpamRecall: 94.495% SpamPrec: 99.927%
with intermittent cases where it just goes nuts and gives 30% FNs (with
most of the scores zeroed). I can't particularly figure out what needs
to be done with the parameters to work around it, nor am I keen to sit
here trying sets of params in a trial-and-error fashion...
I'm going to go back to using the GA, see if I can get better results out
of that. if anyone wants a try to fix the perceptron problems for set0
and set2, the full logs are on the zone at:
-rw-rw-r-- 2 jm other 1446756927 Feb 7 22:20
/export/home/jm/ftp/spamassassin/masses/spam-full.log
-rw-rw-r-- 2 jm other 413512085 Feb 7 22:15
/export/home/jm/ftp/spamassassin/masses/ham-full.log
Theo -- I'm pretty sure it's not a GIGO logs problem, since the results for set
1 and set 3 are quite good, and the freqs look good too. actually, I'll upload
the
freqs for set3, they're worth checking out.
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.