[Bug 3049] Implement sa-learn --loaddb dumpfile functionality

bugzilla-daemon 24 Feb 2004 21:57:58 -0000

http://bugzilla.spamassassin.org/show_bug.cgi?id=3049






------- Additional Comments From [EMAIL PROTECTED]  2004-02-24 13:57 -------
Subject: Re:  Implement sa-learn --loaddb dumpfile functionality

> >Here is another question about the bayes numbers.  Assuming no funny
> >business with merging or something like that, a particular tokens
> >spam/ham count must be <= the number of spam/ham messages learned
> >right?
> >
> >So, if you've only learned 100000 spams, but a token has a spam count
> >of 1+ billion, there is probably something wrong, right?
> 
> yep.  Would suggest skipping such a token during reload in that case.

Why do we do this? According to Paul Graham's site, we'd get better
results if the number of occurences in the DB was the total number of
occurences not the number of messages it which the token was seen.





------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.

[Bug 3049] Implement sa-learn --loaddb dumpfile functionality

Reply via email to