We are running SpamAssassin version 3.2.5 running on Perl version 5.8.8 Solaris 9 Sparc with the MySQL Bayes store and autolearning. We are using bayes_expiry_max_db_size 1000000 Expiry is done manually once a day. Here is a typical output from expiry:
Thu Mar 19 00:12:00 EDT 2009 Forcing Bayes expiry run [2541] dbg: bayes: using username: root [2541] dbg: bayes: database connection established [2541] dbg: bayes: found bayes db version 3 [2541] dbg: bayes: Using userid: 217 [2541] dbg: bayes: bayes journal sync starting [2541] dbg: bayes: bayes journal sync completed [2541] dbg: bayes: expiry starting [2541] dbg: bayes: expiry check keep size, 0.75 * max: 750000 [2541] dbg: bayes: token count: 1792961, final goal reduction size: 1042961 [2541] dbg: bayes: first pass? current: 1237435937, Last: 1237349670, atime: 86400, count: 669993, newdelta: 55502, ratio: 1.55667447271837, period: 43200 [2541] dbg: bayes: can't use estimation method for expiry, unexpected result, calculating optimal atime delta (first pass) [2541] dbg: bayes: expiry max exponent: 9 [2541] dbg: bayes: atime token reduction [2541] dbg: bayes: ======== =============== [2541] dbg: bayes: 43200 1144230 [2541] dbg: bayes: 86400 732048 [2541] dbg: bayes: 172800 0 [2541] dbg: bayes: 345600 0 [2541] dbg: bayes: 691200 0 [2541] dbg: bayes: 1382400 0 [2541] dbg: bayes: 2764800 0 [2541] dbg: bayes: 5529600 0 [2541] dbg: bayes: 11059200 0 [2541] dbg: bayes: 22118400 0 [2541] dbg: bayes: first pass decided on 86400 for atime delta [2541] dbg: bayes: expiry completed expired old bayes database entries in 172 seconds 1060954 entries kept, 732007 deleted token frequency: 1-occurrence tokens: 53.44% token frequency: less than 8 occurrences: 28.22% Thu Mar 19 00:15:09 EDT 2009 Done This is telling me that there are no tokens more than 2 days old. Is this good or bad? Should I be increasing the max DB size? Thanks, Larry Note: Here is the --dump magic output: 0.000 0 3 0 non-token data: bayes db version 0.000 0 7894739 0 non-token data: nspam 0.000 0 10477619 0 non-token data: nham 0.000 0 1428534 0 non-token data: ntokens 0.000 0 1237349612 0 non-token data: oldest atime 0.000 0 1237479369 0 non-token data: newest atime 0.000 0 0 0 non-token data: last journal sync atime 0.000 0 1237436073 0 non-token data: last expiry atime 0.000 0 86400 0 non-token data: last expire atime delta 0.000 0 732007 0 non-token data: last expire reduction count