Hi *, i am new to this list and currently testing a 2-node cluster with heartbeat in vmware-server.
With dopd + fencing resource-only the cluster does not what he is supposed to do. In case of a complete network loss on nodeB, nodeA will not takeover the resources because of dopd can not outdate an already dead system. State change failed: (-7) Refusing to be Primary while peer is not outdated I saw this patch from the author of dopd and I'm not quite shure if this patch is now part of the current release. http://hg.linux-ha.org/dev/file/47f60bebe7b2/contrib/drbd-outdate-peer/dopd.c In my test environment (vmware) i run Ubuntu 8.0.4 with original packages of heartbeat 2.1.3 drbd 8.0.11 =ha.cf= debugfile /var/log/ha-debug logfile /var/log/ha-log deadtime 30 warntime 10 initdead 120 auto_failback on watchdog /dev/watchdog node vm140 node vm142 ping xxx.xxx.xxx.xxx respawn hacluster /usr/lib/heartbeat/ipfail apiauth ipfail gid=haclient uid=hacluster deadping 5 respawn hacluster /usr/lib/heartbeat/dopd apiauth dopd gid=haclient uid=hacluster uuidfrom nodename crm no =haresources= vm140 IPaddr::xxx.xxx.xxx.141/24/eth0 drbddisk::r0 Filesystem::/dev/drbd0::/drbd0::ext3 vm141 vm142 IPaddr::xxx.xxx.xxx.143/24/eth0 drbddisk::r1 Filesystem::/dev/drbd1::/drbd1::ext3 vm143 =drbd.conf= global { usage-count yes; } common { syncer { rate 10M; } } resource r0 { protocol C; handlers { pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f"; pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f"; local-io-error "echo o > /proc/sysrq-trigger ; halt -f"; outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5"; } startup { } disk { on-io-error detach; fencing resource-only; } net { after-sb-0pri disconnect; after-sb-1pri disconnect; after-sb-2pri disconnect; rr-conflict disconnect; } syncer { rate 10M; al-extents 257; } on vm140 { device /dev/drbd0; disk /dev/sdb1; address 10.0.0.1:7788; meta-disk internal; } on vm142 { device /dev/drbd0; disk /dev/sdb1; address 10.0.0.2:7788; meta-disk internal; } } resource r1 { protocol C; handlers { pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f"; pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f"; local-io-error "echo o > /proc/sysrq-trigger ; halt -f"; outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5"; } startup { } disk { on-io-error detach; fencing resource-only; } net { after-sb-0pri disconnect; after-sb-1pri disconnect; after-sb-2pri disconnect; rr-conflict disconnect; } syncer { rate 10M; al-extents 257; } on vm140 { device /dev/drbd1; disk /dev/sdb2; address 10.0.0.1:7789; meta-disk internal; } on vm142 { device /dev/drbd1; disk /dev/sdb2; address 10.0.0.2:7789; meta-disk internal; } } -- Heiko Weier Technische Universität Hamburg-Harburg PGP Key: http://www.tub.tu-harburg.de/keys/Heiko_Weier_pub.asc _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
