Our qmail+ldap installation seems to be losing track of the number of
local deliveries going on.  We have it set up to limit concurrent
local deliveries to 50, and after running for a while, it will think
that it already has 50 local deliveries running, and won't start any
more up.  In reality, though, it has between 0 and 20 running

Here's an example:

mail013#  tail -1000 syslog | grep 'status:' | tail -10
Apr 10 14:09:32 mail013 qmail: 955390172.820586 status: local 50/50 remote 13/50
Apr 10 14:09:35 mail013 qmail: 955390175.792667 status: local 50/50 remote 12/50
Apr 10 14:09:35 mail013 qmail: 955390175.993011 status: local 50/50 remote 11/50
Apr 10 14:09:39 mail013 qmail: 955390179.265143 status: local 50/50 remote 10/50
Apr 10 14:09:40 mail013 qmail: 955390180.472890 status: local 50/50 remote 9/50
Apr 10 14:09:41 mail013 qmail: 955390181.445602 status: local 50/50 remote 8/50
Apr 10 14:09:48 mail013 qmail: 955390188.578305 status: local 50/50 remote 9/50
Apr 10 14:09:48 mail013 qmail: 955390188.705810 status: local 50/50 remote 8/50
Apr 10 14:09:53 mail013 qmail: 955390193.873759 status: local 50/50 remote 9/50
Apr 10 14:09:54 mail013 qmail: 955390194.544234 status: local 50/50 remote 10/50

mail013# ps -ef | grep qmail-local | grep -v grep |wc -l
       0
mail013# ps -ef | grep qmail-lspawn |grep -v grep |wc -l
      19

Additionally, many of the 19 qmail-lspawn processes are a week old or
older, and seem to be hung (they don't have any child processes, and
are in a poll() system call).

This can be solved by shutting down qmail and starting it back up, but
I don't want to have to do this every week for each of our 20 mail
servers.

Has anybody seen this problem before?  We are running qmail-1.03 with
the LDAP patches from 19991007.  We've got a cluster of about 20 mail
servers, each pushing around 65,000 messages/day.  I'm seeing this
problem after running for about 2 weeks.

Thanks for any pointers,

------Scott.


Reply via email to