Justin Mason wrote:
> However: it's important for SpamAssassin developers and mass-checkers to
> get a "representative" feed of spam -- with all kinds of spam included --
> so that the rules are measured against something close to reality.  
On a related note, we actually *stopped* using front-line RBLs as with
them in place, we were no longer able to get true stats as to the actual
flow of Spam/Ham into our sites. Which meant that we really couldn't
tell how effective our antispam systems were being. The "broad axe" that
is RBL meant that a single mail message coming from servers may be
blocked dozens of times (as it retries), meaning that our stats would
over-represent the effectiveness of front-line RBL methods. Now we just
let it all hit SpamAssassin, and have simply upped the score on those
RBLs we used to trust to reject directly, so that the Spam doesn't get
any further. End result: no delivery changes - but better quality stats.
Obviously you have to have over-speced your mail servers to be able to
do this - something poor old Justin can't manage I think :-)

(FYI: picking a random user of ours and looking at all Internet email
they received in Aug 2006 showed SA had >99% success rate at tagging
Spam. 85% was quarantined (scores >10/5) and the rest tagged for the
users to filter on. Also, ZERO ham misclassification - which is
something certain commercial competitors to SpamAssassin are actually
pretty bad at...)

Now if only it could deal with this storm of "VIiiagra"/"VIragra" spam
that has been sneaking in... :-)

-- 
Cheers

Jason Haar
Information Security Manager, Trimble Navigation Ltd.
Phone: +64 3 9635 377 Fax: +64 3 9635 417
PGP Fingerprint: 7A2E 0407 C9A6 CAF6 2B9F 8422 C063 5EBB FE1D 66D1

Reply via email to