We are running
SpamAssassin version 3.2.5
  running on Perl version 5.8.8
  Solaris 9 Sparc
with the MySQL Bayes store and autolearning.  We are using
bayes_expiry_max_db_size  1000000
Expiry is done manually once a day.  Here is a typical output from expiry:

Thu Mar 19 00:12:00 EDT 2009 Forcing Bayes expiry run
[2541] dbg: bayes: using username: root
[2541] dbg: bayes: database connection established
[2541] dbg: bayes: found bayes db version 3
[2541] dbg: bayes: Using userid: 217
[2541] dbg: bayes: bayes journal sync starting
[2541] dbg: bayes: bayes journal sync completed
[2541] dbg: bayes: expiry starting
[2541] dbg: bayes: expiry check keep size, 0.75 * max: 750000
[2541] dbg: bayes: token count: 1792961, final goal reduction size: 1042961
[2541] dbg: bayes: first pass? current: 1237435937, Last: 1237349670, atime: 
86400, count: 669993, newdelta: 55502, ratio: 1.55667447271837, period: 43200
[2541] dbg: bayes: can't use estimation method for expiry, unexpected result, 
calculating optimal atime delta (first pass)
[2541] dbg: bayes: expiry max exponent: 9
[2541] dbg: bayes: atime token reduction
[2541] dbg: bayes: ======== ===============
[2541] dbg: bayes: 43200 1144230
[2541] dbg: bayes: 86400 732048
[2541] dbg: bayes: 172800 0
[2541] dbg: bayes: 345600 0
[2541] dbg: bayes: 691200 0
[2541] dbg: bayes: 1382400 0
[2541] dbg: bayes: 2764800 0
[2541] dbg: bayes: 5529600 0
[2541] dbg: bayes: 11059200 0
[2541] dbg: bayes: 22118400 0
[2541] dbg: bayes: first pass decided on 86400 for atime delta
[2541] dbg: bayes: expiry completed
expired old bayes database entries in 172 seconds
1060954 entries kept, 732007 deleted
token frequency: 1-occurrence tokens: 53.44%
token frequency: less than 8 occurrences: 28.22%
Thu Mar 19 00:15:09 EDT 2009 Done

This is telling me that there are no tokens more than 2 days old.  Is this good 
or bad?  Should I be increasing the max DB size?

Thanks, Larry

Note: Here is the --dump magic output:

0.000          0          3          0  non-token data: bayes db version
0.000          0    7894739          0  non-token data: nspam
0.000          0   10477619          0  non-token data: nham
0.000          0    1428534          0  non-token data: ntokens
0.000          0 1237349612          0  non-token data: oldest atime
0.000          0 1237479369          0  non-token data: newest atime
0.000          0          0          0  non-token data: last journal sync atime
0.000          0 1237436073          0  non-token data: last expiry atime
0.000          0      86400          0  non-token data: last expire atime delta
0.000          0     732007          0  non-token data: last expire reduction 
count

Reply via email to