On Sat, Mar 04, 2006 at 09:56:14PM -0500, Gabriel Wachman wrote: > During training I run: > sa-learn --dbpath $WORKDIR --ham $DATADIR/$message_dir > (likewise for spam) > > During testing I run: > spamassassin -t -p $PREFSPATH $DATADIR/$message_dir
You may want to look into mass-check. It's much better for this type of thing. > During testing, I can see spamassassin create a "bayes_journal" file and > write to it continuously. I understand this is spamassassin's way of Yes. > storing its updates to bayes_* temporarily until the updates are merged. Yes. > My concern is that it's using bayes_journal in addition to bayes_toks > and bayes_seen during testing, but I just want it to use the bayes_toks > and bayes_seen generating during training. If you disable auto_learn, and you don't use sa-learn afterwards, there won't be any other learning. The journal updates (by default) are only atime values used for expiry. You can't disable them, though you can disable the journal sync if you want to ("bayes_journal_max_size 0"). Hope this helps. :) -- Randomly Generated Tagline: "I spaced out on lingo." - Prof. Wills
pgpQJBgdlpisd.pgp
Description: PGP signature