Tony Gan wrote:
> Hi,
> For a two-node cluster, what are the best STONITH devices?
> 
> Currently I am using Dell's iDrac for STONITH device. It works pretty well.
> However the biggest problem for iDrac or any other lights-out devices is
> that they share power supply with hosts machines.
> 
> Once an active machine lost its power completely, you want to fail-over to
> the backup-node in your cluster.
> But with iDrac as your STONITH device you can not, because the STONITH
> resource on backup node will run into error (fail to connect to STONITH
> device, it's out of power too) , and refuse to start any resources.
> 
> 
> I was wondering what kind of STONITH devices everybody is using to solve
> this problem. And how much are they?
> 
> Actually Pacemaker's page have a link talking about this:
> http://www.clusterlabs.org/doc/crm_fencing.html
> 
> It suggests UPS (Uninterruptible Power Supply) as well as PDU (Power
> Distribution Unit).
> Anybody used them before? How well are they integrated with Heartbeat? What
> are the pros and cons?

Hi

I am using APC PDUs for my clusters.

The setup is like:

power supply circuit 1 -> pdu 1 -> node 1
power supply circuit 2 -> pdu 2 -> node 2

If a node fails, the corresponding pdu usually is accessible and manageable.

However, if a pdu fails (and they probably can fail in ways we cannot
really imagine (to quote Dejan)) that renders the same problem as yours.
The node is down, the stonith device is down, so no resource takeover.

But imho, this is not resolvable. At least I do not know of a way how
to. If a PDU or UPS fails (node down and power device down), then the
resources for the failed node will not be recovered since the failed
node cannot be shot.

Regards
Dominik
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to