http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5270


[EMAIL PROTECTED] changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|REOPENED                    |ASSIGNED




------- Additional Comments From [EMAIL PROTECTED]  2007-02-14 09:14 -------
best results I've gotten out of the perceptron so far for set 2 have been:

# SUMMARY for threshold 5.0:
# Correctly non-spam:  67468  99.88%
# Correctly spam:     112527  94.49%
# False positives:        82  0.12%
# False negatives:      6556  5.51%
# TCR(l=50): 11.175206  SpamRecall: 94.495%  SpamPrec: 99.927%

with intermittent cases where it just goes nuts and gives 30% FNs (with
most of the scores zeroed).  I can't particularly figure out what needs
to be done with the parameters to work around it, nor am I keen to sit
here trying sets of params in a trial-and-error fashion...

I'm going to go back to using the GA, see if I can get better results out
of that.  if anyone wants a try to fix the perceptron problems for set0
and set2, the full logs are on the zone at:

-rw-rw-r--   2 jm       other    1446756927 Feb  7 22:20
/export/home/jm/ftp/spamassassin/masses/spam-full.log
-rw-rw-r--   2 jm       other    413512085 Feb  7 22:15
/export/home/jm/ftp/spamassassin/masses/ham-full.log


Theo -- I'm pretty sure it's not a GIGO logs problem, since the results for set
1 and set 3 are quite good, and the freqs look good too.  actually, I'll upload 
the
freqs for set3, they're worth checking out.



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to