On Wed, 24 Nov 2004, Gustafson, Tim wrote:

> How do you keep your ntokens so low?
>
> Mine averages ((nspam + nham) * 10).  Yours is basically (nspam + nham).
> Do you run some job that expires tokens or something?  I'm running
> sa-learn --force-expire once a day (and it takes about 2-3 minutes to
> run) but the ntokens never seems to go down.  :\
>
> Tim

Tim, that's because your Bayes is FUBAR, you've got a "future"
message in there that's fouling up your expire. Run it with
a '--D' and look at the output, I'll bet that it doesn't actually
expire anything.

Revisting your '--dump magic' output again:

[EMAIL PROTECTED] sa-learn --dump magic
0.000          0          2          0  non-token data: bayes db version
0.000          0      88033          0  non-token data: nspam
0.000          0      15592          0  non-token data: nham
0.000          0    1729756          0  non-token data: ntokens
0.000          0 1010964573          0  non-token data: oldest atime
0.000          0 1762110386          0  non-token data: newest atime
0.000          0 1101309901          0  non-token data: last journalsync atime
0.000          0 1101301792          0  non-token data: last expiry atime
0.000          0          0          0  non-token data: last expire atime delta
0.000          0          0          0  non-token data: last expire reduction 
count

Note that 'newest atime' value, it's 21 years in the future. That
is "poisoning" your expire, so it's not doing anyting.

  perl -e 'print scalar localtime(1762110386),"\n";'
  Sun Nov  2 13:06:26 2025

The 'ntokens' should be a more-or-less fixed value, based upon the setting
of your "bayes_expiry_max_db_size". The values of nspam & nham should
continually increase but ntokens should hit an upper bound and go no
higher.

On a busy system that's been running for a while, nspam & nham can
easily out strip ntokens. Here's my stats:

server15$ sa-learn --dump magic
0.000          0          2          0  non-token data: bayes db version
0.000          0    1275494          0  non-token data: nspam
0.000          0     525068          0  non-token data: nham
0.000          0     227192          0  non-token data: ntokens
0.000          0 1101252542          0  non-token data: oldest atime
0.000          0 1101360564          0  non-token data: newest atime
0.000          0 1101360564          0  non-token data: last journal sync atime
0.000          0 1101338998          0  non-token data: last expiry atime
0.000          0      86400          0  non-token data: last expire atime delta
0.000          0      73470          0  non-token data: last expire reduction 
count



-- 
Dave Funk                                  University of Iowa
<dbfunk (at) engineering.uiowa.edu>        College of Engineering
319/335-5751   FAX: 319/384-0549           1256 Seamans Center
Sys_admin/Postmaster/cell_admin            Iowa City, IA 52242-1527
#include <std_disclaimer.h>
Better is not better, 'standard' is better. B{

Reply via email to