having the issues in not-responding services of the last few days in mind i ask if there is a chance to get systemd-watchdog supported - the difference between this and the simple "Restart=always" we have configured is that "Restart=always" only takes action when the service crashed but not if it is running and no longer responding
what makes me headache ist *where* sd_notify should be triggered to catch this hanging-issues because as example a successful client connection followed by a response is not enough and would restart services if the are no users connecting and having a own thread always saying "i am alive" needs not to be the truth in the situations we covered however, it's worth to consider over the long because it would *greatly* reduce the impact of issues by get the daemon relieable restarted and have a syslog entry from systemd that it happened while services still up and running again _______________________________________________ http://www.freedesktop.org/software/systemd/man/systemd.service.html WatchdogSec= Configures the watchdog timeout for a service. The watchdog is activated when the start-up is completed. The service must call sd_notify(3) regularly with "WATCHDOG=1" (i.e. the "keep-alive ping"). If the time between two such calls is larger than the configured time then the service is placed in a failure state. By setting Restart= to on-failure or always the service will be automatically restarted. The time configured here will be passed to the executed service process in the WATCHDOG_USEC= environment variable. This allows daemons to automatically enable the keep-alive pinging logic if watchdog support is enabled for the service. If this option is used NotifyAccess= (see below) should be set to open access to the notification socket provided by systemd. If NotifyAccess= is not set, it will be implicitly set to main. Defaults to 0, which disables this feature The service must call sd_notify(3) regularly with "WATCHDOG=1" http://www.freedesktop.org/software/systemd/man/sd_notify.html
signature.asc
Description: OpenPGP digital signature
_______________________________________________ DBmail mailing list [email protected] http://mailman.fastxs.nl/cgi-bin/mailman/listinfo/dbmail
