http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5497





------- Additional Comments From [EMAIL PROTECTED]  2007-06-06 14:31 -------
> I suspect that at least of the reason is the change in the auto-learn ham
> threshold which means that 3.2 auto-learns a lot fewer messages as ham than
> previous versions did. Which means that if you are just auto-learning, and not
> calling sa-learn to learn ham, a disproportionate number of spam tokens will 
> be
> learnt.

I can definitely say that a vast majority (+95%) of our learning is from the >
auto-learning system. It has been difficult to get our users to feed significant
amounts of both ham and spam to our sa-learn feedback mechanism. Any change in >
the auto-learn thresholds are likely effecting the accuracy of our database.

Is this something simple I can change to revert back to the pre-3.2.0 setting?

What was the previous value for bayes_auto_learn_threshold_nonspam?

> USER OPTIONS
>        The following configuration settings are used to control auto-learn-
>        ing:
> 
>        bayes_auto_learn_threshold_nonspam n.nn   (default: 0.1)
>            The score threshold below which a mail has to score, to be fed
>            into SpamAssassinâs learning systems automatically as a non-spam
>            message.
> 
>        bayes_auto_learn_threshold_spam n.nn      (default: 12.0)
>            The score threshold above which a mail has to score, to be fed
>            into SpamAssassinâs learning systems automatically as a spam mes-
>            sage.
> 
>            Note: SpamAssassin requires at least 3 points from the header, and
>            3 points from the body to auto-learn as spam.  Therefore, the min-
>            imum working value for this option is 6.




------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to