Re: [DNG] Supervision scripts (was Re: OpenRC and Devuan)

Arnt Gulbrandsen Wed, 04 May 2016 13:42:45 -0700

Stephanie Daugherty writes:

Service failures should be extraordinary events, and we shouldstrive to keep treating them as such, so that we continue topursue stability. Restarting a service automatically doesn'timprove stability of that software, it works around aninstability rather than addressing the root cause - it's aband-aid over a festering wound.

Unix has a few design choices that tend to produce problems like these,such as malloc() and its c++ cousin "operator new".

Malloc() is very simple: You ask for memory and get it. The negative sideof that simplicity is that if you're out of memory (and that happensoccasionally if a server is run close to capacity) then processes dieand/or become unresponsive. Such is the tyranny of the Poissondistribution.

The failure of a service is analogous in my eyes to thetripping of a circuit breaker - it happened for a reason, andthat underlying reason is probably serious.

Pick your poison: Restart services or add failure handling around allmalloc() calls. I quite like the former in many cases, even though itpapers over various unintentional problem as well as provide theintentional simplification. But then I like TCP better than NCP, etc.


Arnt

_______________________________________________
Dng mailing list
[email protected]
https://mailinglists.dyne.org/cgi-bin/mailman/listinfo/dng

Re: [DNG] Supervision scripts (was Re: OpenRC and Devuan)

Reply via email to