>> --drbd1 configs-- >> /etc/heartbeat/haresources: >> drbd1 IPaddr::192.168.15.24/24/eth0/192.168.15.255 >> drbddisk::Filesystem::/dev/drbd0::/data::ext3 > ^^^^ > > Hmm, you need say hb, what to do > > try something like: > > drbd1 IPaddr::192.168.15.24/24/eth0 drbddisk::drbddisk > Filesystem::/dev/drbd0::/data::ext3 nfs-kernel-server
Thanks Thomas No dice still. I've got more logs and more info though. I start a ping from the NFS client to the shared heartbeat address. Then on node 2 of the heartbeat cluster I issue '/etc/init.d/heartbeat stop' while tail-ing the logs on node 1 I get the following. --watch for inline comments-- heartbeat[6755]: 2008/12/11_13:43:39 WARN: No reply to standby request. Standby request cancelled. heartbeat[6755]: 2008/12/11_13:44:39 info: Heartbeat restart on node drbd2 heartbeat[6755]: 2008/12/11_13:44:39 info: Link drbd2:eth0 up. heartbeat[6755]: 2008/12/11_13:44:39 info: Status update for node drbd2: status init heartbeat[6755]: 2008/12/11_13:44:39 info: Status update for node drbd2: status up harc[9696]: 2008/12/11_13:44:39 info: Running /etc/ha.d/rc.d/status status harc[9705]: 2008/12/11_13:44:39 info: Running /etc/ha.d/rc.d/status status heartbeat[6755]: 2008/12/11_13:44:40 info: Status update for node drbd2: status active harc[9714]: 2008/12/11_13:44:40 info: Running /etc/ha.d/rc.d/status status heartbeat[6755]: 2008/12/11_13:44:40 info: remote resource transition completed. heartbeat[6755]: 2008/12/11_13:45:04 info: Received shutdown notice from 'drbd2'. heartbeat[6755]: 2008/12/11_13:45:04 info: Resources being acquired from drbd2. harc[9724]: 2008/12/11_13:45:04 info: Running /etc/ha.d/rc.d/status status mach_down[9743]: 2008/12/11_13:45:04 info: /usr/lib/heartbeat/mach_down: nice_failback: foreign resources acquired mach_down[9743]: 2008/12/11_13:45:04 info: mach_down takeover complete for node drbd2. heartbeat[6755]: 2008/12/11_13:45:04 info: mach_down takeover complete. IPaddr[9779]: 2008/12/11_13:45:04 INFO: IPaddr Resource is stopped heartbeat[9725]: 2008/12/11_13:45:04 info: Local Resource acquisition completed. harc[9881]: 2008/12/11_13:45:04 info: Running /etc/ha.d/rc.d/ip-request-resp ip-request-resp ip-request-resp[9881]: 2008/12/11_13:45:04 received ip-request-resp IPaddr::192.168.15.24/24/eth0/192.168.15.255 OK yes ResourceManager[9894]: 2008/12/11_13:45:04 info: Acquiring resource group: drbd1 IPaddr::192.168.15.24/24/eth0/192.168.15.255 drbddisk::Filesystem::/dev/drbd0::/data::ext3 IPaddr[9917]: 2008/12/11_13:45:05 INFO: IPaddr Resource is stopped ResourceManager[9894]: 2008/12/11_13:45:05 info: Running /etc/ha.d/resource.d/IPaddr 192.168.15.24/24/eth0/192.168.15.255 start ---this is where pings start responding to the nfs client and I can mount the nfs share-- IPaddr[10112]: 2008/12/11_13:45:05 INFO: eval /sbin/ifconfig eth0:0 192.168.15.24 netmask 255.255.255.0 broadcast 192.168.15.255 IPaddr[10112]: 2008/12/11_13:45:05 INFO: Sending Gratuitous Arp for 192.168.15.24 on eth0:0 [eth0] IPaddr[10112]: 2008/12/11_13:45:05 INFO: /usr/lib/heartbeat/send_arp -i 500 -r 10 -p /var/run/heartbeat/rsctmp/send_arp/send_arp-192.168.15.24 eth0 192.168.15.24 auto 192.168.15.24 ffffffffffff IPaddr[10030]: 2008/12/11_13:45:05 INFO: IPaddr Success ResourceManager[9894]: 2008/12/11_13:45:05 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 start ResourceManager[9894]: 2008/12/11_13:45:05 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:05 CRIT: Giving up resources due to failure of drbddisk::Filesystem::/dev/drbd0::/data::ext3 ResourceManager[9894]: 2008/12/11_13:45:05 info: Releasing resource group: drbd1 IPaddr::192.168.15.24/24/eth0/192.168.15.255 drbddisk::Filesystem::/dev/drbd0::/data::ext3 ResourceManager[9894]: 2008/12/11_13:45:05 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:05 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:06 info: Retrying failed stop operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3] ResourceManager[9894]: 2008/12/11_13:45:06 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:06 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:07 info: Retrying failed stop operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3] ResourceManager[9894]: 2008/12/11_13:45:07 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:07 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:08 info: Retrying failed stop operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3] ResourceManager[9894]: 2008/12/11_13:45:08 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:08 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:09 info: Retrying failed stop operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3] ResourceManager[9894]: 2008/12/11_13:45:09 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:09 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:10 info: Retrying failed stop operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3] ResourceManager[9894]: 2008/12/11_13:45:10 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:10 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:11 info: Retrying failed stop operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3] ResourceManager[9894]: 2008/12/11_13:45:11 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:11 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:12 info: Retrying failed stop operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3] ResourceManager[9894]: 2008/12/11_13:45:12 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:12 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:13 info: Retrying failed stop operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3] ResourceManager[9894]: 2008/12/11_13:45:13 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:13 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:14 info: Retrying failed stop operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3] ResourceManager[9894]: 2008/12/11_13:45:14 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:14 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:15 info: Retrying failed stop operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3] ResourceManager[9894]: 2008/12/11_13:45:15 info: Running /etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop ResourceManager[9894]: 2008/12/11_13:45:15 ERROR: Return code 1 from /etc/ha.d/resource.d/drbddisk ResourceManager[9894]: 2008/12/11_13:45:15 ERROR: Resource script for drbddisk::Filesystem::/dev/drbd0::/data::ext3 probably not LSB-compliant. ResourceManager[9894]: 2008/12/11_13:45:15 WARN: it (drbddisk::Filesystem::/dev/drbd0::/data::ext3) MUST succeed on a stop when already stopped ResourceManager[9894]: 2008/12/11_13:45:15 WARN: Machine reboot narrowly avoided! ResourceManager[9894]: 2008/12/11_13:45:15 info: Running /etc/ha.d/resource.d/IPaddr 192.168.15.24/24/eth0/192.168.15.255 stop IPaddr[10704]: 2008/12/11_13:45:15 INFO: /sbin/route -n del -host 192.168.15.24 IPaddr[10704]: 2008/12/11_13:45:15 INFO: /sbin/ifconfig eth0:0 192.168.15.24 down ---this is where pings stop again--- IPaddr[10704]: 2008/12/11_13:45:15 INFO: IP Address 192.168.15.24 released IPaddr[10622]: 2008/12/11_13:45:15 INFO: IPaddr Success heartbeat[6755]: 2008/12/11_13:45:36 WARN: node drbd2: is dead heartbeat[6755]: 2008/12/11_13:45:36 info: Dead node drbd2 gave up resources. heartbeat[6755]: 2008/12/11_13:45:36 info: Link drbd2:eth0 dead. hb_standby[10745]: 2008/12/11_13:45:45 Going standby [foreign]. heartbeat[6755]: 2008/12/11_13:45:45 info: drbd1 wants to go standby [foreign] heartbeat[6755]: 2008/12/11_13:45:57 WARN: No reply to standby request. Standby request cancelled. _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
