I'm setting up a 2 node cluster. I have the 2 machines online fine
running Ubuntu
Gutsy. My configs are very simple at this point:
haresources:
#Testing
sherman IPaddr::192.168.12.1/0/eth0
sherman IPaddr::10.1.1.31/0/eth1
ha.cf:
:/etc/ha.d# cat ha.cf | egrep -v '^$|^#'
debugfile /var/log/ha-debug
logfile /var/log/ha-log
logfacility local0
deadtime 15
initdead 60
bcast eth1 # Linux
auto_failback on
node sherman
node redcloud
debug 1
"uname -n" on node1 is "sherman" and node2 is "redcloud". In the
hostfile on
both machines I have this:
127.0.0.1 localhost
10.1.1.250 sherman.coronasolutions.com sherman
10.1.1.251 redcloud.coronasolutions.com redcloud
authkeys:
auth 1
1 sha1 testingcluster
When I start heartbeat I see this in my logs. I'm a bit confused as it
claims success but "ip addr" shows my main IP's and some aliases I set
up but the 2 in haresources are not there:
heartbeat[24892]: 2008/12/16_11:44:16 debug: RscMgmtProc
'ip-request-resp' exited code 0
heartbeat[24892]: 2008/12/16_11:44:16 info: AnnounceTakeover(local 1,
foreign 1, reason 'ip-request-resp' (1))
heartbeat[24892]: 2008/12/16_11:44:16 debug: StartNextRemoteRscReq() -
calling hook
heartbeat[24892]: 2008/12/16_11:44:16 debug: notify_world: invoking
harc: OLD status: active
heartbeat[24892]: 2008/12/16_11:44:16 debug: Process [ip-request-resp]
started pid 25415
heartbeat[24892]: 2008/12/16_11:44:16 debug: Starting notify process
[ip-request-resp]
heartbeat[25415]: 2008/12/16_11:44:16 debug: notify_world: setting
SIGCHLD Handler to SIG_DFL
heartbeat[25415]: 2008/12/16_11:44:16 debug: notify_world: Running harc
ip-request-resp
harc[25415]: 2008/12/16_11:44:16 info: Running
/etc/ha.d/rc.d/ip-request-resp ip-request-resp
ip-request-resp[25415]: 2008/12/16_11:44:16 received ip-request-resp
IPaddr::10.1.1.31/0/eth1 OK yes
ResourceManager[25434]: 2008/12/16_11:44:16 info: Acquiring resource
group: sherman IPaddr::10.1.1.31/0/eth1
IPaddr[25460]: 2008/12/16_11:44:16 INFO: Resource is stopped
ResourceManager[25434]: 2008/12/16_11:44:16 info: Running
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 start
ResourceManager[25434]: 2008/12/16_11:44:16 debug: Starting
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 start
Invalid netmask specification [0]
/usr/lib/heartbeat/findif version 2.1.2 Copyright Alan Robertson
Usage: /usr/lib/heartbeat/findif [-C]
Options:
-C: Output netmask as the number of bits rather than as 4 octets.
Environment variables:
OCF_RESKEY_ip ip address (mandatory!)
OCF_RESKEY_cidr_netmask netmask of interface
OCF_RESKEY_broadcast broadcast address for interface
OCF_RESKEY_nic interface to assign to
IPaddr[25553]: 2008/12/16_11:44:16 ERROR: /usr/lib/heartbeat/findif
failed [rc=1].
IPaddr[25526]: 2008/12/16_11:44:16 ERROR: Generic error
ERROR: Generic error
ResourceManager[25434]: 2008/12/16_11:44:16 debug:
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 start done. RC=1
ResourceManager[25434]: 2008/12/16_11:44:16 ERROR: Return code 1 from
/etc/ha.d/resource.d/IPaddr
ResourceManager[25434]: 2008/12/16_11:44:16 CRIT: Giving up resources
due to failure of IPaddr::10.1.1.31/0/eth1
ResourceManager[25434]: 2008/12/16_11:44:16 info: Releasing resource
group: sherman IPaddr::10.1.1.31/0/eth1
ResourceManager[25434]: 2008/12/16_11:44:16 info: Running
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 stop
ResourceManager[25434]: 2008/12/16_11:44:16 debug: Starting
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 stop
In IP Stop
IPaddr[25629]: 2008/12/16_11:44:17 INFO: Success
INFO: Success
ResourceManager[25434]: 2008/12/16_11:44:17 debug:
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 stop done. RC=0
heartbeat[24892]: 2008/12/16_11:44:17 info: Exiting ip-request-resp
process 25415 returned rc 0.
heartbeat[24892]: 2008/12/16_11:44:17 debug: RscMgmtProc
'ip-request-resp' exited code 0
heartbeat[24892]: 2008/12/16_11:44:17 info: AnnounceTakeover(local 1,
foreign 1, reason 'ip-request-resp' (1))
heartbeat[24892]: 2008/12/16_11:44:26 info: Local Resource acquisition
completed. (none)
heartbeat[24892]: 2008/12/16_11:44:26 info: local resource transition
completed.
heartbeat[24892]: 2008/12/16_11:44:26 debug: Sending hold resources msg:
all, stable=1 # <none>
heartbeat[24892]: 2008/12/16_11:44:26 info: AnnounceTakeover(local 1,
foreign 1, reason 'T_RESOURCES(us)' (1))
heartbeat[24892]: 2008/12/16_11:44:26 debug: hb_rsc_isstable:
ResourceMgmt_child_count: 0, other_is_stable: 1, takeover_in_progress:
0, going_standby: 0, standby running(ms): 0, resourcestate: 4
heartbeat[24892]: 2008/12/16_11:44:26 debug: hb_rsc_isstable:
ResourceMgmt_child_count: 0, other_is_stable: 1, takeover_in_progress:
0, going_standby: 0, standby running(ms): 0, resourcestate: 4
hb_standby[25413]: 2008/12/16_11:44:46 Going standby [foreign].
heartbeat[24892]: 2008/12/16_11:44:46 debug: Received standby message me
from sherman in state 0
heartbeat[24892]: 2008/12/16_11:44:46 debug: ask_for_resources: other
now unstable
heartbeat[24892]: 2008/12/16_11:44:46 info: sherman wants to go standby
[foreign]
heartbeat[24892]: 2008/12/16_11:44:46 info: i_hold_resources: 3
heartbeat[24892]: 2008/12/16_11:44:46 info: New standby state: 1
hb_standby[25677]: 2008/12/16_11:44:47 Going standby [foreign].
heartbeat[24892]: 2008/12/16_11:44:47 debug: Received standby message me
from sherman in state 1
heartbeat[24892]: 2008/12/16_11:44:47 WARN: Standby in progress- new
request from sherman ignored [10 seconds left]
heartbeat[24892]: 2008/12/16_11:44:57 WARN: No reply to standby
request. Standby request cancelled.
--
:wq!
====================================================================
Robert L. Harris | GPG Key ID: E344DA3B
@ x-hkp://pgp.mit.edu
DISCLAIMER:
These are MY OPINIONS With Dreams To Be A King,
ALONE. I speak for First One Should Be A Man
no-one else. - Manowar
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems