Re: [Linux-HA] what to do on loss of network

Steve Wray Wed, 23 Jan 2008 12:39:38 -0800

Well I posted my config and I've tried various things and tested thissetup... and it still behaves incorrectly: going primary in the event ofa complete loss of network connectivity.

I mean... its an NFS server... *network* filesystem. If it can't connectto the network *at* *all* it makes no sense to become the primary NFSserver...

I'd really appreciate some comment on what may be wrong in the configfiles that I've posted. If theres any further info that I need to postplease mention it.


Thanks!


Steve Wray wrote:

Dejan Muhamedagic wrote:
Hi,

On Wed, Jan 16, 2008 at 03:29:59PM +1300, Steve Wray wrote:
Hi there,

I have two servers set up with drbd, heartbeat and NFS.
They are only talking over the network, there is no serial heartbeat,and can't be at the moment.
What I'd like to happen is that if either one loses networkconnectivity they "sit down and shut up", ie the one which cannotreach, say, the router, does not try to become the active node; itsends drbd into standalone, does not mount the drbd filesystem, doesnot bring up NFS and does not bring up the floating interface. It haslost network, it should go into a 'passive' mode and wait for thenetwork to come back.
I tried using 'ping' with "respawn hacluster/usr/lib/heartbeat/ipfail" and to list a few IP addresses whichshould be reachable, but this doesn't seem to do what I want. Rather,when it loses network and cannot ping them it puts drbd intostandalone, mounts the drbd filesystem, starts NFS and brings up theinterface on the floating address... seems a bit wierd to me.
Is there an FAQ or something for this?
If you're using v1 style configuration, then ipfail is the right
thing. Otherwise, you should go with the pingd. See:
I'm using v1.
http://www.linux-ha.org/ipfail
I had read this, which is where I got the idea from.

However, it doesn't seem to be working.

Here is what I have in ha.cf:

logfacility     local0
keepalive 100ms
deadping 5
deadtime 30
warntime 10
ucast eth0 10.10.2.26
ucast eth0 10.10.2.27
node drbd-test-1
node drbd-test-2
auto_failback on
ping 10.10.10.1
respawn hacluster /usr/lib/heartbeat/ipfail



Here is what I have in haresources:

drbd-test-1  \
        drbddisk::drbdtest \
        Filesystem::/dev/drbd0::/data::ext3 \
        killnfsd \
        nfs-common \
        nfs-kernel-server \
        Delay::20::0 \
        IPaddr::10.10.2.28/16/eth0
If I go onto the node which is currently secondary and take down eth0 ittakes over as primary, mounts drbd0, starts nfs and tries to bring upthe floating IP.
This is the opposite of the behavior I expected to achieve with thisconfig.
Note that this is a /16 network ie 255.255.0.0 so all listed IPaddresses are in the same network space. 10.10.10.1 is the default routefor both nodes.
Both nodes are running on the same Xen dom0 host ie there is no physicalnetwork between them (I can't pull a network cable out to test loss ofconnectivity so I use ifdown) and are running Debian Etch.
http://linux-ha.org/pingd

Thanks,

Dejan
Thanks
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] what to do on loss of network

Reply via email to