Hi all,

as a result of the recent "2+2 != 4" discussion on the list, here is a new plugin, which tries to learn ham/spam classification only by knowing which rules triggered and which did not. This is, so to say, an automatic meta rule.

The plugin is currently experimental and can only be checked out from SVN at:

       https://svn.own-hero.net/sysadmin/MetaSVM/trunk


For now I recommend to not use it in production environment, as it is still untested (except that I tested it). In order to use the plugin, you need to train your own model, which requires a certain amount of ham/spam.

I evaluated the plugin with my own ham/spam corpus (roughly 5000 spam, 3000 ham) and the resulting model did not produce false positives with respect to the default scoring, but it catched approx. 30% of the mails that were not catched by SA itself. I'll probably release more detailed numbers in some whitepaper soon :)


Best regards,


Chris


Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

Reply via email to