http://bugzilla.spamassassin.org/show_bug.cgi?id=3049





------- Additional Comments From [EMAIL PROTECTED]  2004-02-24 14:36 -------
Subject: Re:  Implement sa-learn --loaddb dumpfile functionality 

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


>> >So, if you've only learned 100000 spams, but a token has a spam count
>> >of 1+ billion, there is probably something wrong, right?
>> 
>> yep.  Would suggest skipping such a token during reload in that case.
>
>Why do we do this? According to Paul Graham's site, we'd get better
>results if the number of occurences in the DB was the total number of
>occurences not the number of messages it which the token was seen.

if I recall correctly the spambayes guys took a look and found that
to be just an artifact of his version of bayes.  ie. a
bit of a kludge.

- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFAO9HNQTcbUG5Y7woRAmy8AJ9XzgYdJ1ltJIYt0VKnqpJ2YqXvuwCgj5ow
jcIZTFrFUO7jG0f8qms5GCo=
=6nj1
-----END PGP SIGNATURE-----





------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.

Reply via email to