Hi,
You have the same problem as I had before:
http://lists.linux-ha.org/pipermail/linux-ha/2010-April/040035.html

I think the solution is using UPS or PDU for STONITH device. The problem is
that STONITH is treated as a resource. If it failed for any reason, the
cluster will simply fail too.

Therefore to prevent the failure in your scenario from happening, you need
your STONITH devices and host machines to have separate power supply.


Cheers
Tony

On Tue, Apr 6, 2010 at 1:36 PM, Greg Woods <[email protected]> wrote:

> I'm looking for a good way to deal with the "total power drop" case. I
> am using an iDrac 6 as a stonith device on a pair of Dell R710 servers.
> I tried the power drop test today by simply unplugging the power on one
> of the nodes. What happens in this case is that the attempt by the other
> node to stonith the dead node fails, so the other node refuses to take
> over resources.
>
> Since this is a fairly rare scenario (the machines have dual power
> supplies and use the same pair of power circuits, so the chances that
> one node completely loses power and the other doesn't are almost
> nonexistent, the way this could happen is a human accidentally powering
> off the wrong machine), I'd be willing to deal with it in manual mode as
> long as it can be done remotely. Is there any way to manually fence a
> node that I know is dead? I.e. to tell the still-running node "I know
> the other node is dead even though you can't stonith it, please pretend
> the stonith succeeded and take over resources"?
>
> Thanks,
> --Greg
>
>
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to