http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5270
------- Additional Comments From [EMAIL PROTECTED] 2007-02-15 02:54 ------- yay for the GA! I resurrected craig-evolve.c and ran it -- here's the test results from its run: # SUMMARY for threshold 5.0: # Correctly non-spam: 67498 99.92% # Correctly spam: 115160 96.71% # False positives: 52 0.08% # False negatives: 3923 3.29% # TCR(l=50): 18.255864 SpamRecall: 96.706% SpamPrec: 99.955% compare with the best results for the perceptron on the same data set, after a *lot* of futzing with settings: # SUMMARY for threshold 5.0: # Correctly non-spam: 67468 99.88% # Correctly spam: 112527 94.49% # False positives: 82 0.12% # False negatives: 6556 5.51% # TCR(l=50): 11.175206 SpamRecall: 94.495% SpamPrec: 99.927% on the other hand, it took 8 hours to produce the GA results. ;) but still, a hell of a lot better... and it didn't require any tweaking or manual knob-twiddling, it's just fire and forget. I'll use the GA for the other scoreset (0), and maybe try it again on sets 1 and 3 to see if it can beat the perceptron FP%/FN% rates for those too. ------- You are receiving this mail because: ------- You are the assignee for the bug, or are watching the assignee.
