https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6942

--- Comment #14 from Mark Martinec <[email protected]> ---
> > > 12240246  non-token data: nspam
> > >  5076877  non-token data: nham
> > > used_memory_human:3.10G
> > Suggesting: bayes_auto_learn_on_error 1
> 
> why?

>From the stats it seems to me like a large number of tokens,
and 3 GB of resident storage is on a high side and probably
growing still at the same rate (until expiration kicks in).

The bayes_auto_learn_on_error can reduce the growth rate
substantially, without sacrificing much on the quality of
results. Some studies even indicated that a learn_on_error
strategy increased the classification quality (but I won't
speculate on that here).

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to