http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5270





------- Additional Comments From [EMAIL PROTECTED]  2007-02-15 02:54 -------
yay for the GA!  I resurrected craig-evolve.c and ran it -- here's the test
results from its run:

# SUMMARY for threshold 5.0:
# Correctly non-spam:  67498  99.92%
# Correctly spam:     115160  96.71%
# False positives:        52  0.08%
# False negatives:      3923  3.29%
# TCR(l=50): 18.255864  SpamRecall: 96.706%  SpamPrec: 99.955%

compare with the best results for the perceptron on the same data set,
after a *lot* of futzing with settings:

# SUMMARY for threshold 5.0:
# Correctly non-spam:  67468  99.88%
# Correctly spam:     112527  94.49%
# False positives:        82  0.12%
# False negatives:      6556  5.51%
# TCR(l=50): 11.175206  SpamRecall: 94.495%  SpamPrec: 99.927%

on the other hand, it took 8 hours to produce the GA results. ;)  but still,
a hell of a lot better... and it didn't require any tweaking or manual
knob-twiddling, it's just fire and forget.  I'll use the GA for the other
scoreset (0), and maybe try it again on sets 1 and 3 to see if it can beat the
perceptron FP%/FN% rates for those too.



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to