I'm setting up a 2 node cluster. I have the 2 machines online fine running Ubuntu
Gutsy.  My configs are very simple at this point:

haresources:
  #Testing
  sherman IPaddr::192.168.12.1/0/eth0
  sherman IPaddr::10.1.1.31/0/eth1


ha.cf:
:/etc/ha.d# cat ha.cf  | egrep -v '^$|^#'
  debugfile /var/log/ha-debug
  logfile /var/log/ha-log
  logfacility     local0
  deadtime 15
  initdead 60
  bcast eth1      # Linux
  auto_failback on
  node  sherman
  node  redcloud
  debug 1

"uname -n" on node1 is "sherman" and node2 is "redcloud". In the hostfile on
both machines I have this:
  127.0.0.1 localhost
  10.1.1.250 sherman.coronasolutions.com  sherman
  10.1.1.251 redcloud.coronasolutions.com redcloud

authkeys:
  auth 1
  1 sha1 testingcluster


When I start heartbeat I see this in my logs. I'm a bit confused as it claims success but "ip addr" shows my main IP's and some aliases I set up but the 2 in haresources are not there:

heartbeat[24892]: 2008/12/16_11:44:16 debug: RscMgmtProc 'ip-request-resp' exited code 0 heartbeat[24892]: 2008/12/16_11:44:16 info: AnnounceTakeover(local 1, foreign 1, reason 'ip-request-resp' (1)) heartbeat[24892]: 2008/12/16_11:44:16 debug: StartNextRemoteRscReq() - calling hook heartbeat[24892]: 2008/12/16_11:44:16 debug: notify_world: invoking harc: OLD status: active heartbeat[24892]: 2008/12/16_11:44:16 debug: Process [ip-request-resp] started pid 25415 heartbeat[24892]: 2008/12/16_11:44:16 debug: Starting notify process [ip-request-resp] heartbeat[25415]: 2008/12/16_11:44:16 debug: notify_world: setting SIGCHLD Handler to SIG_DFL heartbeat[25415]: 2008/12/16_11:44:16 debug: notify_world: Running harc ip-request-resp harc[25415]: 2008/12/16_11:44:16 info: Running /etc/ha.d/rc.d/ip-request-resp ip-request-resp ip-request-resp[25415]: 2008/12/16_11:44:16 received ip-request-resp IPaddr::10.1.1.31/0/eth1 OK yes ResourceManager[25434]: 2008/12/16_11:44:16 info: Acquiring resource group: sherman IPaddr::10.1.1.31/0/eth1
IPaddr[25460]:  2008/12/16_11:44:16 INFO:  Resource is stopped
ResourceManager[25434]: 2008/12/16_11:44:16 info: Running /etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 start ResourceManager[25434]: 2008/12/16_11:44:16 debug: Starting /etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 start
Invalid netmask specification [0]
/usr/lib/heartbeat/findif version 2.1.2 Copyright Alan Robertson

Usage: /usr/lib/heartbeat/findif [-C]
Options:
    -C: Output netmask as the number of bits rather than as 4 octets.
Environment variables:
OCF_RESKEY_ip      ip address (mandatory!)
OCF_RESKEY_cidr_netmask netmask of interface
OCF_RESKEY_broadcast    broadcast address for interface
OCF_RESKEY_nic     interface to assign to
IPaddr[25553]: 2008/12/16_11:44:16 ERROR: /usr/lib/heartbeat/findif failed [rc=1].
IPaddr[25526]:  2008/12/16_11:44:16 ERROR:  Generic error
ERROR:  Generic error
ResourceManager[25434]: 2008/12/16_11:44:16 debug: /etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 start done. RC=1 ResourceManager[25434]: 2008/12/16_11:44:16 ERROR: Return code 1 from /etc/ha.d/resource.d/IPaddr ResourceManager[25434]: 2008/12/16_11:44:16 CRIT: Giving up resources due to failure of IPaddr::10.1.1.31/0/eth1 ResourceManager[25434]: 2008/12/16_11:44:16 info: Releasing resource group: sherman IPaddr::10.1.1.31/0/eth1 ResourceManager[25434]: 2008/12/16_11:44:16 info: Running /etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 stop ResourceManager[25434]: 2008/12/16_11:44:16 debug: Starting /etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 stop
In IP Stop
IPaddr[25629]:  2008/12/16_11:44:17 INFO:  Success
INFO:  Success
ResourceManager[25434]: 2008/12/16_11:44:17 debug: /etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 stop done. RC=0 heartbeat[24892]: 2008/12/16_11:44:17 info: Exiting ip-request-resp process 25415 returned rc 0. heartbeat[24892]: 2008/12/16_11:44:17 debug: RscMgmtProc 'ip-request-resp' exited code 0 heartbeat[24892]: 2008/12/16_11:44:17 info: AnnounceTakeover(local 1, foreign 1, reason 'ip-request-resp' (1)) heartbeat[24892]: 2008/12/16_11:44:26 info: Local Resource acquisition completed. (none) heartbeat[24892]: 2008/12/16_11:44:26 info: local resource transition completed. heartbeat[24892]: 2008/12/16_11:44:26 debug: Sending hold resources msg: all, stable=1 # <none> heartbeat[24892]: 2008/12/16_11:44:26 info: AnnounceTakeover(local 1, foreign 1, reason 'T_RESOURCES(us)' (1)) heartbeat[24892]: 2008/12/16_11:44:26 debug: hb_rsc_isstable: ResourceMgmt_child_count: 0, other_is_stable: 1, takeover_in_progress: 0, going_standby: 0, standby running(ms): 0, resourcestate: 4 heartbeat[24892]: 2008/12/16_11:44:26 debug: hb_rsc_isstable: ResourceMgmt_child_count: 0, other_is_stable: 1, takeover_in_progress: 0, going_standby: 0, standby running(ms): 0, resourcestate: 4
hb_standby[25413]:     2008/12/16_11:44:46 Going standby [foreign].
heartbeat[24892]: 2008/12/16_11:44:46 debug: Received standby message me from sherman in state 0 heartbeat[24892]: 2008/12/16_11:44:46 debug: ask_for_resources: other now unstable heartbeat[24892]: 2008/12/16_11:44:46 info: sherman wants to go standby [foreign]
heartbeat[24892]: 2008/12/16_11:44:46 info: i_hold_resources: 3
heartbeat[24892]: 2008/12/16_11:44:46 info: New standby state: 1
hb_standby[25677]:     2008/12/16_11:44:47 Going standby [foreign].
heartbeat[24892]: 2008/12/16_11:44:47 debug: Received standby message me from sherman in state 1 heartbeat[24892]: 2008/12/16_11:44:47 WARN: Standby in progress- new request from sherman ignored [10 seconds left] heartbeat[24892]: 2008/12/16_11:44:57 WARN: No reply to standby request. Standby request cancelled.

--

:wq!
====================================================================
Robert L. Harris                     | GPG Key ID: E344DA3B
                                         @ x-hkp://pgp.mit.edu
DISCLAIMER:
      These are MY OPINIONS             With Dreams To Be A King,
       ALONE.  I speak for              First One Should Be A Man
       no-one else.                       - Manowar


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to