I was looking at my Bayes DB files and noticed that they seem very
large.  Is this a problem?

54K May 21 13:28 bayes_journal
82M May 21 13:28 bayes_seen
80M May 21 13:28 bayes_toks

I went ahead and did a sa-learn --dump magic and this is the output:

0.000          0          2          0  non-token data: bayes db version
0.000          0      70627          0  non-token data: nspam
0.000          0      29182          0  non-token data: nham
0.000          0    2041152          0  non-token data: ntokens
0.000          0  956386256          0  non-token data: oldest atime
0.000          0 2093049063          0  non-token data: newest atime
0.000          0 1085163866          0  non-token data: last journal
sync atime
0.000          0 1085163864          0  non-token data: last expiry
atime
0.000          0     172800          0  non-token data: last expire
atime delta
0.000          0      40459          0  non-token data: last expire
reduction count

Does it seem unusual to have 2 million tokens in the database?

Is this something I should worry about or am I good to go?

We are running Spamassassin 2.63 with sa-exim on a Debian box with
2.4.22 kernel which just passes all email to our Exchange servers.  We
get around 30,000 emails a day.

Thanks!

Reply via email to