On Sat, Nov 21, 2009 at 2:52 AM, Michel Rode <[email protected]> wrote: > Hi all, > > i have a little problem with my ssh-stonith-device (yes i know that is > only for testing...) > > To simluate a failure, i kill the openais daemon (and all the rest) on > node2. Then node1 is trying to reset node2, but unfortunately > unsuccessful :(
make sure atd is installed and started at boot. also make sure that the nodes can ssh to one-another _without_ a password (ie. a public key that requires no password). > > node2:~ # rpm -qa | grep -E "pace|openais" > libpacemaker3-1.0.3-4.1 > pacemaker-1.0.3-4.1 > libopenais2-0.80.3-26.1 > openais-0.80.3-26.1 > > on a sles11 with hae. > > Here is the entry from log for stonith: > > Nov 21 02:35:35 node1 stonithd: [3952]: info: client tengine [pid: 3957] > requests a STONITH operation RESET on node node2 > Nov 21 02:35:35 node1 stonithd: [3952]: info: we can't manage node2, > broadcast request to other nodes > Nov 21 02:35:35 node1 stonithd: [3952]: info: Broadcasting the message > succeeded: require others to stonith node node2. > > Configuration: > > # crm configure show > [...] > primitive st-ssh stonith:external/ssh \ > params hostlist="node1-intern node2-intern" > clone fencing st-ssh > property $id="cib-bootstrap-options" \ > dc-version="1.0.3-0080ec086ae9c20ad5c4c3562000c0ad68374f0a" \ > expected-quorum-votes="2" \ > stonith-enabled="true" \ > no-quorum-policy="ignore" \ > last-lrm-refresh="1258769996" > > > When i'm doing this on the console with > > stonith -t ssh -p "node1-intern node2-intern" -T reset node2-intern > > all works fine! So i hope someone can help me. > > Thanks! > > Michel > > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems > _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
