http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5497





------- Additional Comments From [EMAIL PROTECTED]  2007-06-07 07:41 -------
No, I think that the principle that the Bayes learner is relatively insensitive
to the absolute numbers of ham and spam is correct. The problem is that it is
sensitive to getting non-representative samples. To the degree that tokens in
the very lowest scorers are not representative of the tokens in all ham, the
learner will not be accurate.

Autolearning is a substitute for a mechanism in which all ham and all spam are
correctly learnt. The more you get away from that ideal by being conservative
with the threshold, the weaker it will make Bayes.



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to