>> --drbd1 configs--
>> /etc/heartbeat/haresources:
>> drbd1  IPaddr::192.168.15.24/24/eth0/192.168.15.255
>> drbddisk::Filesystem::/dev/drbd0::/data::ext3
>                                                 ^^^^
>
> Hmm,                                      you need say hb, what to do
>
> try something like:
>
> drbd1  IPaddr::192.168.15.24/24/eth0 drbddisk::drbddisk
> Filesystem::/dev/drbd0::/data::ext3 nfs-kernel-server

Thanks Thomas

No dice still.  I've got more logs and more info though.

I start a ping from the NFS client to the shared heartbeat address.
Then on node 2 of the heartbeat cluster I issue '/etc/init.d/heartbeat
stop' while tail-ing the logs on node 1 I get the following.

--watch for inline comments--

heartbeat[6755]: 2008/12/11_13:43:39 WARN: No reply to standby
request.  Standby request cancelled.
heartbeat[6755]: 2008/12/11_13:44:39 info: Heartbeat restart on node drbd2
heartbeat[6755]: 2008/12/11_13:44:39 info: Link drbd2:eth0 up.
heartbeat[6755]: 2008/12/11_13:44:39 info: Status update for node
drbd2: status init
heartbeat[6755]: 2008/12/11_13:44:39 info: Status update for node
drbd2: status up
harc[9696]:     2008/12/11_13:44:39 info: Running /etc/ha.d/rc.d/status status
harc[9705]:     2008/12/11_13:44:39 info: Running /etc/ha.d/rc.d/status status
heartbeat[6755]: 2008/12/11_13:44:40 info: Status update for node
drbd2: status active
harc[9714]:     2008/12/11_13:44:40 info: Running /etc/ha.d/rc.d/status status
heartbeat[6755]: 2008/12/11_13:44:40 info: remote resource transition completed.
heartbeat[6755]: 2008/12/11_13:45:04 info: Received shutdown notice
from 'drbd2'.
heartbeat[6755]: 2008/12/11_13:45:04 info: Resources being acquired from drbd2.
harc[9724]:     2008/12/11_13:45:04 info: Running /etc/ha.d/rc.d/status status
mach_down[9743]:        2008/12/11_13:45:04 info:
/usr/lib/heartbeat/mach_down: nice_failback: foreign resources
acquired
mach_down[9743]:        2008/12/11_13:45:04 info: mach_down takeover complete
for node drbd2.
heartbeat[6755]: 2008/12/11_13:45:04 info: mach_down takeover complete.
IPaddr[9779]:   2008/12/11_13:45:04 INFO: IPaddr Resource is stopped
heartbeat[9725]: 2008/12/11_13:45:04 info: Local Resource acquisition completed.
harc[9881]:     2008/12/11_13:45:04 info: Running
/etc/ha.d/rc.d/ip-request-resp ip-request-resp
ip-request-resp[9881]:  2008/12/11_13:45:04 received ip-request-resp
IPaddr::192.168.15.24/24/eth0/192.168.15.255 OK yes
ResourceManager[9894]:  2008/12/11_13:45:04 info: Acquiring resource
group: drbd1 IPaddr::192.168.15.24/24/eth0/192.168.15.255
drbddisk::Filesystem::/dev/drbd0::/data::ext3
IPaddr[9917]:   2008/12/11_13:45:05 INFO: IPaddr Resource is stopped
ResourceManager[9894]:  2008/12/11_13:45:05 info: Running
/etc/ha.d/resource.d/IPaddr 192.168.15.24/24/eth0/192.168.15.255 start

---this is where pings start responding to the nfs client and I can
mount the nfs share--

IPaddr[10112]:  2008/12/11_13:45:05 INFO: eval /sbin/ifconfig eth0:0
192.168.15.24 netmask 255.255.255.0 broadcast 192.168.15.255
IPaddr[10112]:  2008/12/11_13:45:05 INFO: Sending Gratuitous Arp for
192.168.15.24 on eth0:0 [eth0]
IPaddr[10112]:  2008/12/11_13:45:05 INFO: /usr/lib/heartbeat/send_arp
-i 500 -r 10 -p
/var/run/heartbeat/rsctmp/send_arp/send_arp-192.168.15.24 eth0
192.168.15.24 auto 192.168.15.24 ffffffffffff
IPaddr[10030]:  2008/12/11_13:45:05 INFO: IPaddr Success
ResourceManager[9894]:  2008/12/11_13:45:05 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 start
ResourceManager[9894]:  2008/12/11_13:45:05 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:05 CRIT: Giving up resources
due to failure of drbddisk::Filesystem::/dev/drbd0::/data::ext3
ResourceManager[9894]:  2008/12/11_13:45:05 info: Releasing resource
group: drbd1 IPaddr::192.168.15.24/24/eth0/192.168.15.255
drbddisk::Filesystem::/dev/drbd0::/data::ext3
ResourceManager[9894]:  2008/12/11_13:45:05 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:05 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:06 info: Retrying failed stop
operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3]
ResourceManager[9894]:  2008/12/11_13:45:06 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:06 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:07 info: Retrying failed stop
operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3]
ResourceManager[9894]:  2008/12/11_13:45:07 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:07 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:08 info: Retrying failed stop
operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3]
ResourceManager[9894]:  2008/12/11_13:45:08 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:08 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:09 info: Retrying failed stop
operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3]
ResourceManager[9894]:  2008/12/11_13:45:09 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:09 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:10 info: Retrying failed stop
operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3]
ResourceManager[9894]:  2008/12/11_13:45:10 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:10 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:11 info: Retrying failed stop
operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3]
ResourceManager[9894]:  2008/12/11_13:45:11 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:11 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:12 info: Retrying failed stop
operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3]
ResourceManager[9894]:  2008/12/11_13:45:12 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:12 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:13 info: Retrying failed stop
operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3]
ResourceManager[9894]:  2008/12/11_13:45:13 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:13 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:14 info: Retrying failed stop
operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3]
ResourceManager[9894]:  2008/12/11_13:45:14 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:14 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:15 info: Retrying failed stop
operation [drbddisk::Filesystem::/dev/drbd0::/data::ext3]
ResourceManager[9894]:  2008/12/11_13:45:15 info: Running
/etc/ha.d/resource.d/drbddisk Filesystem /dev/drbd0 /data ext3 stop
ResourceManager[9894]:  2008/12/11_13:45:15 ERROR: Return code 1 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[9894]:  2008/12/11_13:45:15 ERROR: Resource script for
drbddisk::Filesystem::/dev/drbd0::/data::ext3 probably not
LSB-compliant.
ResourceManager[9894]:  2008/12/11_13:45:15 WARN: it
(drbddisk::Filesystem::/dev/drbd0::/data::ext3) MUST succeed on a stop
when already stopped
ResourceManager[9894]:  2008/12/11_13:45:15 WARN: Machine reboot
narrowly avoided!
ResourceManager[9894]:  2008/12/11_13:45:15 info: Running
/etc/ha.d/resource.d/IPaddr 192.168.15.24/24/eth0/192.168.15.255 stop
IPaddr[10704]:  2008/12/11_13:45:15 INFO: /sbin/route -n del -host 192.168.15.24
IPaddr[10704]:  2008/12/11_13:45:15 INFO: /sbin/ifconfig eth0:0
192.168.15.24 down

---this is where pings stop again---

IPaddr[10704]:  2008/12/11_13:45:15 INFO: IP Address 192.168.15.24 released
IPaddr[10622]:  2008/12/11_13:45:15 INFO: IPaddr Success
heartbeat[6755]: 2008/12/11_13:45:36 WARN: node drbd2: is dead
heartbeat[6755]: 2008/12/11_13:45:36 info: Dead node drbd2 gave up resources.
heartbeat[6755]: 2008/12/11_13:45:36 info: Link drbd2:eth0 dead.
hb_standby[10745]:      2008/12/11_13:45:45 Going standby [foreign].
heartbeat[6755]: 2008/12/11_13:45:45 info: drbd1 wants to go standby [foreign]
heartbeat[6755]: 2008/12/11_13:45:57 WARN: No reply to standby
request.  Standby request cancelled.
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to