Gustafson, Tim wrote:

0.000          0          2          0  non-token data: bayes db version
0.000          0      88033          0  non-token data: nspam
0.000          0      15592          0  non-token data: nham
0.000          0    1729756          0  non-token data: ntokens
0.000          0 1010964573          0  non-token data: oldest atime
0.000          0 1762110386          0  non-token data: newest atime
0.000          0 1101309901          0  non-token data: last journal
sync atime
0.000          0 1101301792          0  non-token data: last expiry
atime
0.000          0          0          0  non-token data: last expire
atime delta
0.000          0          0          0  non-token data: last expire
reduction count

I agree with Jim that having your SPAM/HAM numbers match doesn't really
matter, as long as you have sufficient amounts of each.  I think the
"threshold" where my users started to expierence the best filtering
accuracy was when I topped 1000 SPAMs and HAMs.  But, as Jim said
before, your mileage may vary.

Since we're all playing show-and-tell, here is a dump of the magic on my company's mail server.


0.000 0 3 0 non-token data: bayes db version
0.000 0 101024 0 non-token data: nspam
0.000 0 164343 0 non-token data: nham
0.000 0 240026 0 non-token data: ntokens
0.000 0 1101226944 0 non-token data: oldest atime
0.000 0 1101313137 0 non-token data: newest atime
0.000 0 1101313136 0 non-token data: last journal sync atime
0.000 0 1101270336 0 non-token data: last expiry atime
0.000 0 43200 0 non-token data: last expire atime delta
0.000 0 196502 0 non-token data: last expire reduction count


My auto-learn thresholds are set as follows in the global local.cf.

        bayes_auto_learn_threshold_nonspam 0.8
        bayes_auto_learn_threshold_spam 10.0

It is very important that you keep your bayes_min_[ham|spam]_num settings to at least 1000.

--
Matt Barton
Webexcellence
PH: 317.423.3548 x22
TF: 800.808.6332 x22
FX: 317.423.8735
[EMAIL PROTECTED]
www.webexc.com

Reply via email to