Re: [Linux-HA] Secondary reboot

Dejan Muhamedagic Tue, 26 Feb 2008 02:28:31 -0800

Hi,

On Tue, Feb 26, 2008 at 04:35:06PM +1300, Luke Pascoe wrote:
> Hello
>
> I'm trying to do NFS failover in a test environment with an underlying 
> OCFS2 filesystem. This is something that's apparently been done before and 
> certainly HA NFS isn't new.
>
> I've followed several HowTos, all of which seem to suggest pretty much the 
> same setup, but I seem to get the same problem no matter how I configure 
> it.
>
> Here's the setup:
>
> 2 VMWare VMs (NODE01 and NODE02) running RHEL4 U5 x86_64 with a shared 
> fibre channel SAN volume. That volume is OCFS2 formatted and mounted as 
> /data on both hosts.
>
> Each host has 2 interfaces. An external facing 10.0.0.0/24 and an internal 
> 192.168.100.0/30
>
> Heartbeat is installed on both nodes and configured identically as follows:
>
> =ha.cf=
> keepalive 2
> deadtime 30
> warntime 10
> initdead 120
> bcast   eth0
> ucast eth1 192.168.100.1 # Obviously on NODE02 this is 192.168.100.2
> auto_failback on
> node NODE02 NODE01
>  ping 10.0.0.133 # this is an unrelated LAN host
> respawn hacluster /usr/lib64/heartbeat/ipfail
>    use_logd yes
>  crm off
> =/ha.cf=
>
> =haresources=
> NODE01 10.199.133.90 nfslock nfs_wrapper
> =/haresources=
>
> BTW, crm is off because I tried it with it on and got EXACTLY the same 
> result.
>
> Here's the problem:
>
> Both hosts start up just fine, NODE01 picks up all 3 resources and 
> everything's roses. If I induce a failure on NODE01, NODE02 correctly 
> acquires the resources and everything is still roses. HOWEVER, ~30 seconds 
> later NODE02 reboots. Now the odd thing is, it doesn't matter which is the 
> primary node, or which host has the failure or even which has the 
> resources, NODE02 always reboots when there's a failure.


How do you induce a failover?

> Even if the resources are started on NODE02 and NODE01 has a failure (ie, 
> everything should stay as it is, no failover required) 30 seconds after the 
> failure NODE02 reboots!!!
>
> I've got NTP syncing the time, so it's not a clock issue, and I've tried 
> twiddling just about every setting in the config, to no avail.
>
> Any help? Please?

Anything in the logs? Perhaps post them.

Thanks,

Dejan

> -- 
> Regards
>
> Luke Pascoe
> Linux Systems Engineer
> Asterisk
>
> T 09 366 8835
> F 09 302 1772
> M 0274 266649
> E [EMAIL PROTECTED]
> W www.asterisk.co.nz
>
> Level 9, Gen-i Tower, 66 Wyndham Street
> PO Box 8804, Auckland, New Zealand
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Secondary reboot

Reply via email to