On Tue, Apr 08, 2025 at 01:13:25PM -0400, Greg Troxel wrote: > I would write a script to run ps alxw once a minute and save it to files > named by date +%s, and see if that results in interesting values in > wchan on the next incident.
Finally been lucking doing that (by sending the "ps alxw" output every minute to a remote system). What I am seeing is that processes end up waiting in "tstile" (just showing those). It started at Wed Aug 20 01:39:03 UTC 2025 with: 1001 1066 1 0 117 0 39632 6748 tstile D ? 0:09.01 /usr/pkg/sbin/lighttpd -f /usr/pkg/etc/lighttpd/lighttpd.conf Wed Aug 20 01:40:03 UTC 2025 0 603 1 0 117 0 36252 2540 tstile Ds ? 0:03.45 /usr/sbin/syslogd -s 1001 1066 1 0 117 0 39632 6748 tstile D ? 0:09.01 /usr/pkg/sbin/lighttpd -f /usr/pkg/etc/lighttpd/lighttpd.conf continuing until 2 am when we get a newsyslog process: Wed Aug 20 02:00:04 UTC 2025 0 603 1 0 117 0 36252 2540 tstile Ds ? 0:03.45 /usr/sbin/syslogd -s 1001 1066 1 0 117 0 39632 6748 tstile D ? 0:09.01 /usr/pkg/sbin/lighttpd -f /usr/pkg/etc/lighttpd/lighttpd.conf 0 16952 7133 0 117 0 15760 1612 tstile Ds ? 0:00.00 /usr/bin/newsyslog Christof -- https://cmeerw.org sip:cmeerw at cmeerw.org mailto:cmeerw at cmeerw.org xmpp:cmeerw at cmeerw.org