Alexey Loukianov wrote:
> Hello all,
> 
> I'm forced to run a bunch of SA-specialized servers to be able to
> handle processing of all the incoming mail to the corporate servers.
> All the SA hosts utilize the same HA mysql for bayest storage DB, and
> a simple king of load-balancing for SA is achieved by built in spamc
> functionality.
> 
> From time to time some of SA servers tend to 'stuck'. This shows up in
> logs like this:
> 
> # qmlog -s @40000000458765251acf0b74.s spamd | grep -E 'error|warn'
> 2006-12-19 00:53:00.508318500 [10802] warn: spamd: timeout: (300 second 
> timeout while trying to PROCESS) at /usr/bin/spamd line 1686, <GEN6356> line 
> 254.
> 2006-12-19 00:53:14.979634500 [11224] warn: spamd: timeout: (300 second 
> timeout while trying to PROCESS) at /usr/bin/spamd line 1686, <GEN6395> line 
> 337.
> 2006-12-19 00:53:20.724340500 [11225] warn: spamd: timeout: (300 second 
> timeout while trying to PROCESS) at /usr/bin/spamd line 1686, <GEN6400> line 
> 319.
> 2006-12-19 00:53:25.781288500 [11226] warn: spamd: timeout: (300 second 
> timeout while trying to PROCESS) at /usr/bin/spamd line 1686, <GEN6403> line 
> 300.
> 2006-12-19 00:53:44.309364500 [10261] warn: spamd: timeout: (300 second 
> timeout while trying to PROCESS) at /usr/bin/spamd line 1686, <GEN6428> line 
> 319.
> 2006-12-19 00:58:13.590168500 [10802] error: child processing timeout at 
> /usr/bin/spamd line 1085, <GEN6359> line 334.
> 2006-12-19 00:58:13.590507500 [10802] error: child processing timeout at 
> /usr/bin/spamd line 1085, <GEN6359> line 334.
> 2006-12-19 00:58:16.081281500 [11224] error: child processing timeout at 
> /usr/bin/spamd line 1085, <GEN6401> line 664.
> 2006-12-19 00:58:16.081622500 [11224] error: child processing timeout at 
> /usr/bin/spamd line 1085, <GEN6401> line 664.
> 2006-12-19 00:58:24.804143500 [11225] error: child processing timeout at 
> /usr/bin/spamd line 1085, <GEN6401> line 253.
> 2006-12-19 00:58:24.804156500 [11225] error: child processing timeout at 
> /usr/bin/spamd line 1085, <GEN6401> line 253.
> 2006-12-19 00:58:33.883826500 [11226] error: child processing timeout at 
> /usr/bin/spamd line 1085, <GEN6406> line 321.
> 2006-12-19 00:58:33.883837500 [11226] error: child processing timeout at 
> /usr/bin/spamd line 1085, <GEN6406> line 321.
> 2006-12-19 00:58:43.214455500 [10261] error: child processing timeout at 
> /usr/bin/spamd line 1085, <GEN6430> line 1371.
> 2006-12-19 00:58:43.214841500 [10261] error: child processing timeout at 
> /usr/bin/spamd line 1085, <GEN6430> line 1371.
> 
> From the moment the first warning shows up in the logs and until the
> spamd would be restarted by hand processing stucks, and all the spamd
> processes die after a 300 sec timeout, with a corresponding message in
> logs.
> 
> I know that this issue is related to bayest rules, as if I turn'em off
> in local.cf - no hangs happen. Some time ago E."S". mentioned in list
> that there's an issue in current SA that might cause such timeouts,
> and that it's connected with SA bayest autoexpire function. No
> problems, turned autoexpire off in local.cf, restarted spamd, headed
> on to crontab and set up hourly job to force tokens expiry.
> 
> Nevertheless, after about 11 hours the stuck happened again.
> 
> Anybody else experiencing familiar issue?
> 

I haven't seen this problem since turning autoexpire off.

Sounds like a mysql problem (either getting to or w/in). Any indications on
the mysql side?

P.S. The new qmlog has grep built in. Perhaps I should've made it egrep?
-- 
-Eric 'shubes'

---------------------------------------------------------------------
     QmailToaster hosted by: VR Hosted <http://www.vr.org>
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to