On 02/20/2014 02:04 PM, Lindsay Haisley wrote: > > Here's a sampling of the qrunner log from the wee hours, before I > started poking at the problem to try to fix it: > > Feb 20 03:22:02 2014 (2447) IncomingRunner qrunner caught SIGINT. Stopping. > Feb 20 03:22:02 2014 (2447) IncomingRunner qrunner exiting. > Feb 20 03:22:02 2014 (2445) BounceRunner qrunner caught SIGINT. Stopping. > Feb 20 03:22:02 2014 (2445) BounceRunner qrunner exiting. > Feb 20 03:22:02 2014 (2446) CommandRunner qrunner caught SIGINT. Stopping. > Feb 20 03:22:02 2014 (2446) CommandRunner qrunner exiting. > Feb 20 03:22:02 2014 (2451) RetryRunner qrunner caught SIGINT. Stopping. > Feb 20 03:22:02 2014 (2443) Master watcher caught SIGINT. Restarting. > Feb 20 03:22:02 2014 (2444) ArchRunner qrunner caught SIGINT. Stopping. > Feb 20 03:22:02 2014 (2444) ArchRunner qrunner exiting. > Feb 20 03:22:02 2014 (2448) NewsRunner qrunner caught SIGINT. Stopping. > Feb 20 03:22:02 2014 (2450) VirginRunner qrunner caught SIGINT. Stopping. > Feb 20 03:22:02 2014 (2449) OutgoingRunner qrunner caught SIGINT. Stopping. > Feb 20 03:22:02 2014 (2451) RetryRunner qrunner exiting. > Feb 20 03:22:02 2014 (2448) NewsRunner qrunner exiting. > Feb 20 03:22:02 2014 (2450) VirginRunner qrunner exiting. > Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit > (pid: 2445, sig: None, sts: 2, class: BounceRunner, slice: 1/1) [restarting] > Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit > (pid: 2446, sig: None, sts: 2, class: CommandRunner, slice: 1/1) [restarting] > Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit > (pid: 2451, sig: None, sts: 2, class: RetryRunner, slice: 1/1) [restarting] > Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit > (pid: 2448, sig: None, sts: 2, class: NewsRunner, slice: 1/1) [restarting] > Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit > (pid: 2444, sig: None, sts: 2, class: ArchRunner, slice: 1/1) [restarting] > Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit > (pid: 2447, sig: None, sts: 2, class: IncomingRunner, slice: 1/1) [restarting]
OK, From what you report here and elsewhere, it appears the issue was with OutgoingRunner not processing Mailman's 'out' queue. If the above log excerpt (appears to be from a mailmanctl restart) is complete, you will note that there are three entries for most runners, e,g. > Feb 20 03:22:02 2014 (2447) IncomingRunner qrunner caught SIGINT. Stopping. > Feb 20 03:22:02 2014 (2447) IncomingRunner qrunner exiting. > Feb 20 03:22:02 2014 (2443) Master qrunner detected subprocess exit > (pid: 2447, sig: None, sts: 2, class: IncomingRunner, slice: 1/1) [restarting] But there is only one for OutgoingRunner > Feb 20 03:22:02 2014 (2449) OutgoingRunner qrunner caught SIGINT. Stopping. suggesting that it was hung and never terminated. Had it been me at that point, I would have stopped Mailman and made sure it was completely stopped per the FAQ at <http://wiki.list.org/x/_4A9>, and then started it to see if that fixed the problem. If the out queue were still not being processed, I would try to trace the OutgoingRunner process to see where it was hung. -- Mark Sapiro <m...@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan ------------------------------------------------------ Mailman-Users mailing list Mailman-Users@python.org https://mail.python.org/mailman/listinfo/mailman-users Mailman FAQ: http://wiki.list.org/x/AgA3 Security Policy: http://wiki.list.org/x/QIA9 Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/ Unsubscribe: https://mail.python.org/mailman/options/mailman-users/archive%40jab.org