Re: [gentoo-user] Re: monit and friends.

2017-10-18 Thread skyclan
Hi Alan, This isn't exactly what you describe for your needs but have you considered using auto-remediation outside of the box? I've been using StackStorm https://stackstorm.com/ for the last year in an environment of ~1500 physical servers for this purpose and it's been quite successful.

Re: [gentoo-user] Re: monit and friends.

2017-10-16 Thread Michael Orlitzky
On 10/16/2017 11:50 AM, Alan McKinnon wrote: > > What I need here is a small app that will be a constrained, > single-purpose watchdog. If a daemon fails, the watchdog attempts 3 > restarts to get it going, and records the fact it did it (that goes into > the big monitoring system as a reportable

Re: [gentoo-user] Re: monit and friends.

2017-10-16 Thread Alan McKinnon
On 16/10/2017 18:10, Ralph Seichter wrote: > On 16.10.2017 17:50, Alan McKinnon wrote: > >> Nagios and I go way back, way way waay back. I now recommend it >> never be used unless there really is no other option. > > Have you tried Icinga 2 (*) yet? It originally started as a Nagios fork >

Re: [gentoo-user] Re: monit and friends.

2017-10-16 Thread Ralph Seichter
On 16.10.2017 17:50, Alan McKinnon wrote: > Nagios and I go way back, way way waay back. I now recommend it > never be used unless there really is no other option. Have you tried Icinga 2 (*) yet? It originally started as a Nagios fork and uses plugins to monitor, but the rule-based

Re: [gentoo-user] Re: monit and friends.

2017-10-16 Thread Alan McKinnon
On 16/10/2017 17:41, Mick wrote: > On Monday, 16 October 2017 16:12:53 BST Alan McKinnon wrote: >> On 16/10/2017 17:08, Ian Zimmerman wrote: >>> On 2017-10-16 14:11, Alan McKinnon wrote: My needs here are pretty simple: local watchdog that checks if a program is running and restart it if

Re: [gentoo-user] Re: monit and friends.

2017-10-16 Thread Mick
On Monday, 16 October 2017 16:12:53 BST Alan McKinnon wrote: > On 16/10/2017 17:08, Ian Zimmerman wrote: > > On 2017-10-16 14:11, Alan McKinnon wrote: > >> My needs here are pretty simple: > >> local watchdog that checks if a program is running and restart it if > >> not. If that fails 3 times or

Re: [gentoo-user] Re: monit and friends.

2017-10-16 Thread Alan McKinnon
On 16/10/2017 17:08, Ian Zimmerman wrote: > On 2017-10-16 14:11, Alan McKinnon wrote: > >> My needs here are pretty simple: >> local watchdog that checks if a program is running and restart it if >> not. If that fails 3 times or so, alert me. >> Maybe a few file/dir/fifo monitors as well. Not