On Mon, Sep 08, 2008 at 03:27:31PM -0400, Wietse Venema wrote: > Devdas Bhagat: > > The last error messages I get are these: > > Sep 8 13:54:37 jaundiced-outlook postfix/smtp[7998]: warning: problem > > talking to service private/scache: Connection timed out > > Sep 8 13:54:37 jaundiced-outlook postfix/smtp[20375]: warning: problem > > talking to service private/scache: Connection timed out > > Sep 8 13:54:37 jaundiced-outlook postfix/smtp[7960]: warning: problem > > talking to service private/scache: Connection timed out > > Sep 8 13:54:37 jaundiced-outlook postfix/smtp[17618]: warning: problem > > talking to service private/scache: Connection timed out > > <snip about 600 similar lines about this problem> > > Sep 8 14:10:56 jaundiced-outlook postfix/master[11125]: fatal: watchdog > > timeout > > Sep 8 14:10:56 jaundiced-outlook postfix/qmgr[13568]: fatal: watchdog > > timeout > > I think that the kernel is running out of steam. > > Try reducing the concurrency. > > The master daemon triggers qmgr and pickup regularly. That "trigger" > write is non-blocking with a timeout of 1, so it cannot block the > master daemon. Except of course when the kernel is messed up.
Hmm, this is Linux 2.6.9-67.0.1.EL #1 Fri Nov 30 11:41:37 EST 2007 x86_64 x86_64 x86_64 GNU/Linux on a RHEL 4 box. I'll lower the concurrency and see if the system stabilizes. Devdas Bhagat
