------ Original Message ------
From: "Jeremy Harris via Exim-users" <[email protected]>
To: [email protected]
Sent: 04.02.2022 18:35:14
Subject: Re: [exim] stuck exim processes

On 04/02/2022 09:53, Martin Waschbüsch via Exim-users wrote:
this process is stuck

Any and all hints are appreciated!

Strace the process.  Is it doing anything?  Waiting on network i/o?
Spinning?

Got another message stuck this way:

root@relay01:~# ps ax | grep exim
  807  -  Ss       0:07.01 /usr/local/sbin/exim -bd -q30m
35680  -  S        0:00.01 /usr/local/sbin/exim -Mc 1nGzzC-0009HT-3F
35685  -  I        0:00.03 /usr/local/sbin/exim -Mc 1nGzzC-0009HT-3F
36493  -  I        0:00.03 /usr/local/sbin/exim -bd -q30m

I only have truss, not strace:

root@relay01:~# truss -p 35680
wait4(-1,{ STOPPED,sig=127 },WNOHANG,0x0)    = 0 (0x0)

root@relay01:~# truss -p 35685

after stopping exim and killing the processes, I restarted exim.
This time (did not happen the last time), the message got frozen:

2022-02-07 09:58:25.414 [35680] 1nGzzC-0009HT-3F Delivery status for [email protected]: got 0 of 7 bytes (pipeheader) from transport process 35685 for transport smtp 2022-02-07 09:58:25.417 [35680] 1nGzzC-0009HT-3F == [email protected] R=dnslookup T=remote_smtp defer (-1) DT=0.000s: smtp transport process returned non-zero status 0x0009: terminated by signal 9
2022-02-07 09:58:25.417 [35680] 1nGzzC-0009HT-3F Frozen
2022-02-07 09:58:41.525 [36659] 1nGzzC-0009HT-3F Message is frozen

forcing a queue run (and thawing) by exim -qff -v resulted in:

root@relay01:~# exim -qff -v
LOG: MAIN
  cwd=/root 3 args: exim -qff -v
LOG: queue_run MAIN
  Start queue run: pid=36669 -qff
delivering 1nGzzC-0009HT-3F (queue run pid 36669)
LOG: MAIN
  Unfrozen by forced delivery
LOG: MAIN
  Completed QT=52m15s
LOG: queue_run MAIN
  End queue run: pid=36669 -qff

and in the log:

2022-02-07 09:59:17.864 [36671] 1nGzzC-0009HT-3F Unfrozen by forced delivery
2022-02-07 09:59:17.866 [36671] 1nGzzC-0009HT-3F Completed QT=52m15s

No delivery attempt in the log.


What does "mailq" say (after you kill the process) - is the problem
message still on the queue?  Is that recipient still undelivered?

If so, run a manual delivery with debug:
  exim -d+all -M <message-id>  2>&1 | tee my_debug_logfile

I will try the manual delivery with debug the next time around, but maybe this is helpful already in narrowing in on what is happening. Also: Going through the logs, it does not always occurr in combination with a TLS error, so for now I think that part is just a coincidence.


How far does it get?
-- Cheers,
  Jeremy

-- ## List details at https://lists.exim.org/mailman/listinfo/exim-users
## Exim details at http://www.exim.org/
## Please use the Wiki with this list - http://wiki.exim.org/


--
## List details at https://lists.exim.org/mailman/listinfo/exim-users
## Exim details at http://www.exim.org/
## Please use the Wiki with this list - http://wiki.exim.org/

Reply via email to