Jordan Brown (Sun) writes:
> I know you tried to steer us away from the "restart" question, but I 
> think that remains relevant here.

I'm gonna keep on doing that, because the original problem that we
were discussing (before this thread veered out of control) would have
been NO DIFFERENT if SMF had done no restart at all.

It was the sending of the uncaught (and unexpected) signal to the
parent that was the issue, not any restart.

> In either event, I don't think the goal is to assign blame for the failure.

The goal of sending a termination signal to some subset of all of the
processes on the signal is to impute "blame."  Those processes are the
ones "damaged" by the failure and that are sharing fate.

It's the fact that this is a _new_ behavior and that it's really not
well-understood that's at issue.

> > Still, even in a development environment, I don't want ifconfig(1M)
> > (or its invoker) to take a signal if dhcpagent or in.mpathd dies, or
> > if dhcpagent's event hook script (from the user) drops core.  In those
> > cases, that'll happen today, and it's not right.
> 
> I'm not so sure.  If those things fail, I *want* the issue raised.  In 
> my more draconian moments, I want it raised so loudly that it cannot be 
> ignored, so that it _must_ be addressed.  (Plus, in a development 
> environment, doesn't your "all in the same playground" comment apply?)

I want _an_ issue raised when those problems occur, regardless of
environment.

However, it's not the _fault_ of ifconfig or its invoker if anything
goes wrong with these other processes.

That's the point I'm trying -- very hard -- to get across.

-- 
James Carlson, Solaris Networking              <james.d.carlson at sun.com>
Sun Microsystems / 35 Network Drive        71.232W   Vox +1 781 442 2084
MS UBUR02-212 / Burlington MA 01803-2757   42.496N   Fax +1 781 442 1677

Reply via email to