http://bugzilla.spamassassin.org/show_bug.cgi?id=2853
------- Additional Comments From [EMAIL PROTECTED] 2004-05-14 09:56 ------- Subject: Re: Rewrite masses/ (in perl) -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 > I'm almost done on this. I'd like to see it in 3.0.0. I need to do a > little more testing and clean up the documenation, but my (quick) results > suggest an average TCR of 64.622 (scoreset 2) which is better than the old > stuff which had a TCR of 56.288. (I did make a bit of a change to the > algorithm to score ranges to use soratio instead of rank, so that probably > accounts for much of the difference.) Cool! one thing though -- I found that relying entirely on soratio could be bad news when the hit-rates are too small; e.g. a test that hits 0.05% of spam and 0% ham may get a score of 4.0. This could be tricky if the training corpus isn't representative of other corpora, which might have some ham hits. Have you tested using 10FCV? that takes this into account. - --j. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.4 (GNU/Linux) Comment: Exmh CVS iD8DBQFApPoEQTcbUG5Y7woRAgpRAKCIbYO0+X61RuQY4bf1Ff5OFn6xRwCfTgdo 3OIwbtxaofwfJhRnC/9Ot2o= =vQ0i -----END PGP SIGNATURE----- ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
