http://bugzilla.spamassassin.org/show_bug.cgi?id=3049
------- Additional Comments From [EMAIL PROTECTED] 2004-02-24 13:57 ------- Subject: Re: Implement sa-learn --loaddb dumpfile functionality > >Here is another question about the bayes numbers. Assuming no funny > >business with merging or something like that, a particular tokens > >spam/ham count must be <= the number of spam/ham messages learned > >right? > > > >So, if you've only learned 100000 spams, but a token has a spam count > >of 1+ billion, there is probably something wrong, right? > > yep. Would suggest skipping such a token during reload in that case. Why do we do this? According to Paul Graham's site, we'd get better results if the number of occurences in the DB was the total number of occurences not the number of messages it which the token was seen. ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
