On 18/03/15 21:46, Reindl Harald wrote:

Am 18.03.2015 um 22:29 schrieb David B Funk:
 > Just have an internal mail-submission port that isn't routed thru SA

may not be possible if you have hundrets of domains without setup a
internal DNS view just for a different MX

In general you don't want auto-mail running thru SA for this reason and
to prevent Bayes poisinging

in general autolearning is a major mistake at all

frankly i trained over months with *hand chosen* mail smaples and spent
nearly two weeks day and night to remove bayes-posioning from the
samples and rebuild bayes from scratch leading in reduce the ntokens
from 1700000 to 1500000


Wow - you must be fun at parties...

Personally I find the bayes_autolearn_on_error option which was added to SA 3.3, works pretty well to avoid poisoning and overtraining. From the docs:

bayes_auto_learn_on_error (0 | 1) (default: 0)

With bayes_auto_learn_on_error off, autolearning will be performed even if bayes classifier already agrees with the new classification (i.e. yielded BAYES_00 for what we are now trying to teach it as ham, or yielded BAYES_99 for spam). This is a traditional setting, the default was chosen to retain backwards compatibility.

With bayes_auto_learn_on_error turned on, autolearning will be performed only when a bayes classifier had a different opinion from what the autolearner is now trying to teach it (i.e. it made an error in judgement). This strategy may or may not produce better future classifications, but usually works very well, while also preventing unnecessary overlearning and slows down database growth.

Regards,
Steve.

Reply via email to