On Fri, Feb 22, 2013 at 2:05 AM, Lars Marowsky-Bree <[email protected]> wrote:
> On 2013-02-21T08:39:52, Andrew Beekhof <[email protected]> wrote:
>
>> I think what he's suggesting is that the agent might be trying to do a
>> reboot but some part of that process stalls/aborts/times-out and only
>> the off part happens.
>
> That'd be a kernel or firmware bug though. sbd reboots or powers off by
> writing to /proc/sysrq; reboot is not a two-phase operation.

Fair enough. I was just going on his description, not suggesting where
the fault was.

>
> (Or, if everything really blows up, the system is rebooted by the
> hardware watchdog kicking in. But I've never actually seen a system do
> that in the field; because there's typically multiple SBD processes
> watching each other (the master and the children), the only way to get
> there for testing is to kill all SBD processes hard at the very same
> time, without even leaving a gap for the signal delivery of process
> death to the others ...)
>
>
> Regards,
>     Lars
>
> --
> Architect Storage/HA
> SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, 
> HRB 21284 (AG Nürnberg)
> "Experience is the name everyone gives to their mistakes." -- Oscar Wilde
>
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to