Eric "Shubes" wrote:
> Ok, so here's my situation.
> 
> I have two apparently identical low-volume servers. One shows the corruption
> symptoms, the other doesn't (yet).
> 
> Here is the local.cf (same on both):
> ok_locales all
> skip_rbl_checks 0
> required_hits 5
> report_safe 0
> rewrite_header Subject [SPAM]
> use_pyzor 1
> use_auto_whitelist 1
> bayes_path /home/vpopmail/.spamassassin/bayes
> use_bayes 1
> use_bayes_rules 1
> bayes_auto_learn 1
> bayes_auto_learn_threshold_spam 7.0
> bayes_auto_learn_threshold_nonspam 0.1
> bayes_auto_expire 1
> 
> The problem seems to manifest itself when tokens begin to become due to 
> expire.
> 
> # spamassassin -D bayes --lint
> shows that the problem server has never sync'd, while the one which hasn't
> failed (yet) shows a sync'd value.
> 
> Does anyone know when, under normal operation, the bayes db is supposed to
> be sync'd? I would think that after an autolearn=ham or autolearn=spam that
> this would happen. On one server it appears to do so (and things are fine),
> but on one where it doesn't (for a long period of time), the expiration
> process appears to exhibit problems.
> 
> Some of this observation may be coincidental, but I'm thinking that not
> syncing might be the root cause of this problem.
> 
> I've gotta go for now, but will try to follow up on this thread as soon as I
> can.

Ok, I found out what's happening from the SA list:

> The child is trying to run a Bayes expire, apparently on a large Bayes 
> database
> that hasn't had a successful expiry run in some time.  This attempt to
process
> the Bayes database is probably taking over 300 seconds, and the child is
being
> timed out and killed by something.  As a result of being killed, it never
> finished the Bayes expire processing.  So the next child tries to do the same
> thing, gets timed out and killed, the nex child tries to do the same thing...
> 
> Run a manual Bayes expire run and it will probably clean up your problems.
> If this sort of problem starts to reoccur you might consider turning off bayes
> auto expire and setting up a cron run to do it once a day or so.
> (Or more often, depending on your mail volume.)

Note (to myself), I changed timeoutsmtpd from 1200 (default 20 minutes) to
60 (1 minute) to keep sluggish connections from backing up. Perhaps that
wasn't such a good idea. I think this problem happened before I did that
though, but I can't be positive.

I think that the toaster should have a permanent fix for this. I'm thinking
that auto expire should be turned off, and a daily cron job created to
handle expirations.

The only other way to fix it I can think of would be to increase the timeout
window with auto expire on, but I don't like that idea. 20 minutes seems too
long of a period of inactivity for an smtp connection. There's also a risk
that the sending server will terminate the connection before the expiration
process completes.

Suggestions?

Gotta go away for a while - will check back when I can.
-- 
-Eric 'shubes'

---------------------------------------------------------------------
     QmailToaster hosted by: VR Hosted <http://www.vr.org>
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to