Well, I got STONITH to finally work automatically with external/ipmi settings, but I've ran into a whole new problem: it seems to work too well. What I mean by this is that when one machine comes back on from a STONITH shutdown (which I turn back on manually), it shuts the other machine down. As you can imagine, this just keeps going on forever...
I currently have my my stonith-action set to "poweroff", maybe I should set this to "reboot"? Here's my current crm configure show output: node carbon > node oxygen > primitive neon_vm ocf:heartbeat:VirtualDomain \ > params config="/etc/libvirt/qemu/neon.xml" > hypervisor="qemu:///system" \ > meta allow-migrate="true" \ > op start interval="0" timeout="120s" \ > op stop interval="0" timeout="120s" \ > op monitor interval="10" timeout="30" depth="0" > primitive nickel_vm ocf:heartbeat:VirtualDomain \ > params config="/etc/libvirt/qemu/nickel.xml" > hypervisor="qemu:///system" \ > meta allow-migrate="true" \ > op start interval="0" timeout="120s" \ > op stop interval="0" timeout="120s" \ > op monitor interval="10" timeout="30" depth="0" > primitive res_drbd_disk0 ocf:linbit:drbd \ > params drbd_resource="disk0" \ > op monitor interval="120s" \ > op start interval="0" timeout="240s" \ > op stop interval="0" timeout="100s" > primitive res_fs ocf:heartbeat:Filesystem \ > params device="/dev/drbd0" directory="/shared" fstype="ext4" \ > meta target-role="Started" > primitive res_ip ocf:heartbeat:IPaddr2 \ > params ip="192.168.10.150" cidr_netmask="24" nic="eth0" > primitive silicon_vm ocf:heartbeat:VirtualDomain \ > params config="/etc/libvirt/qemu/silicon.xml" > hypervisor="qemu:///system" \ > meta allow-migrate="true" \ > op start interval="0" timeout="120s" \ > op stop interval="0" timeout="120s" \ > op monitor interval="10" timeout="30" depth="0" > primitive st-ipmilan stonith:external/ipmi \ > params hostname="carbon" ipaddr="192.168.10.148" userid="admin" > passwd="secret" interface="lan" pcmk_host_check="static-list" > pcmk_host_list="carbon" \ > op start interval="0" timeout="60s" requires="nothing" \ > op monitor interval="3600s" timeout="20s" requires="nothing" > primitive st-ipmilan2 stonith:external/ipmi \ > params hostname="oxygen" ipaddr="192.168.10.149" userid="admin" > passwd="secret" interface="lan" pcmk_host_check="static-list" > pcmk_host_list="oxygen" \ > op start interval="0" timeout="60s" requires="nothing" \ > op monitor interval="3600s" timeout="20s" requires="nothing" > primitive zinc_vm ocf:heartbeat:VirtualDomain \ > params config="/etc/libvirt/qemu/zinc.xml" > hypervisor="qemu:///system" \ > meta allow-migrate="true" \ > op start interval="0" timeout="120s" \ > op stop interval="0" timeout="120s" \ > op monitor interval="10" timeout="30" depth="0" > group rg_disk0 res_fs res_ip silicon_vm zinc_vm neon_vm nickel_vm > ms ms_drbd_disk0 res_drbd_disk0 \ > meta notify="true" master-max="1" master-node-max="1" > clone-max="2" clone-node-max="1" > location drbd-fence-by-handler-ms_drbd_disk0 ms_drbd_disk0 \ > rule $id="drbd-fence-by-handler-rule-ms_drbd_disk0" $role="Master" > -inf: #uname ne oxygen > location pref-stonith-carbon st-ipmilan -inf: carbon > location pref-stonith-oxygen st-ipmilan2 -inf: oxygen > colocation c_disk0_on_drbd inf: rg_disk0 ms_drbd_disk0:Master > order o_drbd_before_vms inf: ms_drbd_disk0:promote rg_disk0:start > property $id="cib-bootstrap-options" \ > dc-version="1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c" \ > cluster-infrastructure="openais" \ > expected-quorum-votes="2" \ > stonith-enabled="true" \ > no-quorum-policy="ignore" \ > default-action-timeout="240" \ > stonith-action="poweroff" _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
