Re: [DNG] Supervision scripts (was Re: OpenRC and Devuan)

Simon Walter Thu, 05 May 2016 04:53:08 -0700


On 05/05/2016 03:18 AM, Stephanie Daugherty wrote:

Process supervision is something I'm very opinionated about. In anumber of high availability production environments, its a necessaryevil.
However, it should *never* be an out of the box default for anynetwork-exposed service, Service failures should be extraordinaryevents, and we should strive to keep treating them as such, so that wecontinue to pursue stability. Restarting a service automaticallydoesn't improve stability of that software, it works around aninstability rather than addressing the root cause - it's a band-aidover a festering wound.
The failure of a service is analogous in my eyes to the tripping of acircuit breaker - it happened for a reason, and that underlying reasonis probably serious. Circuit breakers in houses generally don't resetthemselves, and either should network-facing services.
The biggest concern in any service failure is that a failure wascaused by an exploit attempt - attacks which exploit badmemory-management tend to crash whatever they are exploiting, even ona failed attempt. In an environment where such an event has beenreduced to routine, and automatic restarts are the norm, that attackergets as many attempts as they need, reducing one of the first signs ofan intrusion to barely a blip on the radar if the systems are evenbeing monitored at all.
The second reason is that it will reduce the number of high-qualitybug reports developers receive - if failure is part of the routine, ittends not to get investigate very thoroughly, if at all.
A third reason is convention and expectation. We've lived withoutprocess supervision in the *nix world for almost 4 decades now, thosedecades of experienced admins generally expect to be able to kill offa process and have it stay down.
Please consider these factors in any implementation of processsupervision - while it's certainly it's a needed improvement for manyorganizations,, it's not something that should just be on by default.

I couldn't agree more. Some systems I've administered had monitoringdaemons, but they would only warn the admin via email and not actautomatically.

When you are working with many servers, you want to have your ownmonitoring like icinga for example. I think warning notifications bydefault are a good thing.



On 05/05/2016 05:45 AM, Rainer Weikusat wrote:

It greatly reduces the number of "low-quality" (or rather, "no quality")
bug reports I receive as I don't (usually) get frantic phone calls at
3am UK time because a server in Texas terminated itself for some
reason. Instead, I can collect the core file as soon as I get around to
that and fix the bug.

NB: I deal with appliances (as developer) and not with servers (as
sysadmin).

So, for example, would something like daemontools be what you use withyour field deployed software?

I tend to think that something like automatic restarts are the exceptionrather than the rule, and so no default support needs to be provided.

I would not like to, for example, install apache and mod php and have itrestart after it has crashed due to a crappy PHP application. I am ofthe opinion that is a big security risk. I am sure much thought has beenspent on the subject of sane defaults for a server.

_______________________________________________
Dng mailing list
Dng@lists.dyne.org
https://mailinglists.dyne.org/cgi-bin/mailman/listinfo/dng

Re: [DNG] Supervision scripts (was Re: OpenRC and Devuan)

Reply via email to