On 8/21/07, sebastien lorandel <[EMAIL PROTECTED]> wrote: > hi, > > I descovered that stonith needed some hardware, in the first time I thought > it was only a piece of software...
For testing purposes there is also a ssh stonith agent available. > But > 1 - I also read that it is not imperative, so when does it become required? > I am installing a two node cluster managing ssh, ip address and conntrackd. Whenever there is data shared between nodes stonith is highly recommended to avoid data corruption ... e.g. when two nodes try to access the same file system. In case of a split brain situation where a cluster is split into subclusters with equal node count stonith is a way to regain quorum by resetting one node. The subcluster who 'wins the race' to stonith a node in the concurrent subcluster takes over all resources. A two node heartbeat cluster is a special case because it has always quorum to allow one single node to run resources. In case of a cluster with an uneven node count you could also rely on quorum only but the safest way is stonith. In worst case you have two nodes running the same ip/ssh/conntrackd resources at the same time in your two node cluster if such a split brain situation occours and you don't have stonith configured. > 2 - And I also don't understand why we need hardware, why isn't it directly > implemented in Heartbeat telling a node it should restart? How should heartbeat decide if a node is down or if all communication paths are unavailble to contact the oder node? If a stonith action is successfull it is save to decide a node is really dead and you can be shure it has no resources running. There are e.g. management facilities with an extra network port available to allow remote restarts of a server in case the server is completely unresponsive or you can use manageable UPS facilities. > 3 - And then my last question, how can we know if a switch is a STONIH one? see above Regards, Andreas > > Thanks in advance, maybe these questions can seem stupid to some of you but > I didn't saw answer to them in the mailing list and the website. > > -- > Sébastien Lorandel > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems > _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
