> RH/3 is simply my rule of thumb, because I generally deal with a limited 
> corpus of only 100k emails or so. IMO, if tested via corpora with enough 
> emails for testing, RH/2 wouldn't be unreasonable. 

Sure, the perceptron does the same, but much better than humans (which
is why I generally avoid second guessing scores).  Henry is
experimenting with rule accuracy degradation over time and perhaps the
perceptron can handle this even better in the future.

-- 
Daniel Quinlan                     ApacheCon! 13-17 November (3 SpamAssassin
http://www.pathname.com/~quinlan/  http://www.apachecon.com/  sessions & more)

Reply via email to