http://bugzilla.spamassassin.org/show_bug.cgi?id=3049
------- Additional Comments From [EMAIL PROTECTED] 2004-02-24 14:36 ------- Subject: Re: Implement sa-learn --loaddb dumpfile functionality -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 >> >So, if you've only learned 100000 spams, but a token has a spam count >> >of 1+ billion, there is probably something wrong, right? >> >> yep. Would suggest skipping such a token during reload in that case. > >Why do we do this? According to Paul Graham's site, we'd get better >results if the number of occurences in the DB was the total number of >occurences not the number of messages it which the token was seen. if I recall correctly the spambayes guys took a look and found that to be just an artifact of his version of bayes. ie. a bit of a kludge. - --j. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.4 (GNU/Linux) Comment: Exmh CVS iD8DBQFAO9HNQTcbUG5Y7woRAmy8AJ9XzgYdJ1ltJIYt0VKnqpJ2YqXvuwCgj5ow jcIZTFrFUO7jG0f8qms5GCo= =6nj1 -----END PGP SIGNATURE----- ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
