Hi *,

i am new to this list and currently testing a 2-node cluster with
heartbeat in vmware-server.

With dopd + fencing resource-only the cluster does not what he is
supposed to do.

In case of a complete network loss on nodeB, nodeA will not takeover the
resources because of dopd can not outdate an already dead system.


State change failed: (-7) Refusing to be Primary while peer is not outdated

I saw this patch from the author of dopd and I'm not quite shure if this
patch is now part of the current  release.


http://hg.linux-ha.org/dev/file/47f60bebe7b2/contrib/drbd-outdate-peer/dopd.c



In my test environment (vmware) i run

Ubuntu 8.0.4
with original packages of
heartbeat 2.1.3
drbd 8.0.11

=ha.cf=

debugfile /var/log/ha-debug
logfile /var/log/ha-log
deadtime 30
warntime 10
initdead 120
auto_failback on
watchdog /dev/watchdog
node    vm140
node    vm142
ping xxx.xxx.xxx.xxx
respawn hacluster /usr/lib/heartbeat/ipfail
apiauth ipfail gid=haclient uid=hacluster
deadping 5
respawn hacluster /usr/lib/heartbeat/dopd
apiauth dopd gid=haclient uid=hacluster
uuidfrom nodename
crm no

=haresources=
vm140 IPaddr::xxx.xxx.xxx.141/24/eth0  drbddisk::r0
Filesystem::/dev/drbd0::/drbd0::ext3 vm141
vm142 IPaddr::xxx.xxx.xxx.143/24/eth0  drbddisk::r1
Filesystem::/dev/drbd1::/drbd1::ext3 vm143

=drbd.conf=
global {
usage-count yes;
}
common {
syncer { rate 10M; }
}
resource r0 {
protocol C;
handlers {
pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f";
pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f";
local-io-error "echo o > /proc/sysrq-trigger ; halt -f";
outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5";
}
startup {
}
disk {
on-io-error   detach;
fencing resource-only;
}
net {
after-sb-0pri disconnect;
after-sb-1pri disconnect;
after-sb-2pri disconnect;
rr-conflict disconnect;
}
syncer {
rate 10M;
al-extents 257;
}
on vm140 {
device     /dev/drbd0;
disk       /dev/sdb1;
address    10.0.0.1:7788;
meta-disk  internal;
}
on vm142 {
device    /dev/drbd0;
disk      /dev/sdb1;
address   10.0.0.2:7788;
meta-disk internal;
}
}
resource r1 {
protocol C;
handlers {
pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f";
pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f";
local-io-error "echo o > /proc/sysrq-trigger ; halt -f";
outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5";
}
startup {
}
disk {
on-io-error   detach;
fencing resource-only;
}
net {
after-sb-0pri disconnect;
after-sb-1pri disconnect;
after-sb-2pri disconnect;
rr-conflict disconnect;
}
syncer {
rate 10M;
al-extents 257;
}
on vm140 {
device     /dev/drbd1;
disk       /dev/sdb2;
address    10.0.0.1:7789;
meta-disk  internal;
}
on vm142 {
device    /dev/drbd1;
disk      /dev/sdb2;
address   10.0.0.2:7789;
meta-disk internal;
}
}
-- 


Heiko Weier           Technische Universität Hamburg-Harburg

PGP Key:
http://www.tub.tu-harburg.de/keys/Heiko_Weier_pub.asc
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to