http://bugzilla.spamassassin.org/show_bug.cgi?id=4505





------- Additional Comments From [EMAIL PROTECTED]  2005-08-03 19:49 -------
Subject: Re:  Score generation for SpamAssassin 3.1

FWIW, the data from scoreset 3 more closely supports using the equation 
(bayes_group-50)/(50/3.5) to calculate the score.  This is quite close to 
Justin's values above 50, but departs considerably at lower Bayes values:

Group   Set 3   Norm 3.5        Justin 2        Justin 3
0       -2.600  -3.500  -2.312  -2.599
5       -0.410  -3.150  -1.110  -1.110
20      -1.950  -2.100  -0.740  -0.740
40      -1.100  -0.700  -0.185  -0.185
50      0.000   0.000   0.001   0.001
60      0.370   0.700   1.000   1.000
80      2.090   2.100   2.000   2.000
95      2.060   3.150   3.000   3.000
99      1.890   3.430   3.500   3.500

The "Norm 3.5" group matching the above equation is very close to the 
Perceptron scores for Bayes_20 to Bayes_80.  The Perceptron score for Bayes_05 
is just plain wonky, and of course the scores flatten completely at Bayes_80.

Running a simple linear solution to approximate the bayes-20 to bayes-80 scores 
with a straight line produces a slightly lower value for the constant (3.5) 
above: 3.3875.  This of course produces slightly less aggessive scores on the 
top and bottom ends:

Group   Set 3   Norm 3.3875
0       -2.600  -3.388
5       -0.410  -3.049
20      -1.950  -2.033
40      -1.100  -0.678
50      0.000   0.000
60      0.370   0.678
80      2.090   2.033
95      2.060   3.049
99      1.890   3.320





------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to