AlexB wrote:
Chris

From the README its not quite clear: will this work in "autolearn" ?
If you mean that the plugin can automatically learn with the autolearn setting, answer is no.

would it be enough to create the model.* files or is it a must to feed it?
You create one model file once by feeding it a large corpus of ham+spam. Once you did that, and evaluated it as described in the README, the model should be working accurately enough for your mail gateway and I expect it to work for a long time, mainly because it isn't depending that much on the type of spam (i.e. the results that the model produces are assumed to be more generalizable than for example your bayes db)


I cases of busy gateways, where manual training is higly unpractical, it would need to feed itself with headers from SA report's score >X
The problem is that feeding does not work with an SVM algorithm. You have to train on the _whole_ set _always_, so feeding mails is unpractical.

That's why you do this process _once_ with a lot of ham and spam. You can repeat this process any time but it isn't necessary to do this permanently.

It is to be expected that the model accuracy will decrease with time ( a) because your rules change and b) because spam changes ) but I think this is a slow process.



It has yet to be evaluated how well the model performs over time :)



Best regards,


Chris

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

Reply via email to