On 2013-02-21T08:39:52, Andrew Beekhof <[email protected]> wrote:

> I think what he's suggesting is that the agent might be trying to do a
> reboot but some part of that process stalls/aborts/times-out and only
> the off part happens.

That'd be a kernel or firmware bug though. sbd reboots or powers off by
writing to /proc/sysrq; reboot is not a two-phase operation.

(Or, if everything really blows up, the system is rebooted by the
hardware watchdog kicking in. But I've never actually seen a system do
that in the field; because there's typically multiple SBD processes
watching each other (the master and the children), the only way to get
there for testing is to kill all SBD processes hard at the very same
time, without even leaving a gap for the signal delivery of process
death to the others ...)


Regards,
    Lars

-- 
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 
21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to