Re: A different approach to scoring spamassassin hits

Loren Wilton Sat, 30 Jun 2007 20:55:50 -0700

Unfortunately I'm not on the SpamAssassin Bayes modules -- I wrote my ownBayes Engine because I wanted to do that and then thought about includingthe Rules results from SpamAssassin. I don't know where this might begoing, but it seems to be working extremely well for me based on atraining set of just a couple hundred emails in total.

Don't see this as a problem. Someone, I forget who, has a Bayes chained toan SA setup, I think the Bayes comes first, but I don't recall. He wasclaiming good results from chained classifiers using slightly different dataand methods. This seems like a reasonably possible contention to me.

If you have a pre-existing Bayes mail filter, and it runs as a filter in apipe or the like, then basically what you want to do seems very simple tome, at least conceptually. Just run the mail through SA first and then intoyour classifier. The rule names hit along with their scores will be in theheader of the mail you process in your classifier, and thus, as long as youdon't ignore header data, the rule names are there to process. No need evento modify SA. In fact you can get a header with just the rule names hitwithout the scores, so you don't have the score values being scored astokens.

The only case where you would have to modify SA in I think either Check orPMS is if you really did want to bloat every mail with the names of all ofthe rules in the SA database, rather than just those pertanent to the mailat hand.

I hink the trick is simply looking at your mail chain and figuring out howto insert a call to SA before the call to your own Bayes module.


       Loren

Re: A different approach to scoring spamassassin hits

Reply via email to