http://bugzilla.spamassassin.org/show_bug.cgi?id=3821





------- Additional Comments From [EMAIL PROTECTED]  2004-09-30 22:31 -------
Subject: Re:  scores are overoptimized for training set

> RH/3 is simply my rule of thumb, because I generally deal with a limited 
> corpus of only 100k emails or so. IMO, if tested via corpora with enough 
> emails for testing, RH/2 wouldn't be unreasonable. 

Sure, the perceptron does the same, but much better than humans (which
is why I generally avoid second guessing scores).  Henry is
experimenting with rule accuracy degradation over time and perhaps the
perceptron can handle this even better in the future.





------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to