Yeah, you hit it right on the head. Saw it about 10 mins ago, changed it to /24 (was thinking
255.255.255.0) and ti's behaving itself.

I need it to restart a script in /etc/init.d. It almost looks like I can just create a new
script:  /etc/ha.d/resource.d/myscripts

In that script just have:

#!/bin/bash
/etc/init.d/myscripts restart


I don't need a "start/stop" functionality at this point but it might be useful. Looking at the other scripts in resources.d doesn't make much sense (keeping in mind I've only been looking at it for about 2 days). Any good link on how to make a start/stop/restart resource to call an init script?

Robert


On 12/16/08 1:05 PM, Dejan Muhamedagic wrote:
Hi,

On Tue, Dec 16, 2008 at 11:49:48AM -0700, Robert L. Harris wrote:
   I'm setting up a 2 node cluster.  I have the 2 machines online fine
running Ubuntu
Gutsy.  My configs are very simple at this point:

haresources:
   #Testing
   sherman IPaddr::192.168.12.1/0/eth0
   sherman IPaddr::10.1.1.31/0/eth1

Bad netmasks.

Thanks,

Dejan

ha.cf:
:/etc/ha.d# cat ha.cf  | egrep -v '^$|^#'
   debugfile /var/log/ha-debug
   logfile /var/log/ha-log
   logfacility     local0
   deadtime 15
   initdead 60
   bcast eth1      # Linux
   auto_failback on
   node  sherman
   node  redcloud
   debug 1

"uname -n" on node1 is "sherman" and node2 is "redcloud".  In the hostfile
on
both machines I have this:
   127.0.0.1 localhost
   10.1.1.250 sherman.coronasolutions.com  sherman
   10.1.1.251 redcloud.coronasolutions.com redcloud

authkeys:
   auth 1
   1 sha1 testingcluster


When I start heartbeat I see this in my logs.  I'm a bit confused as it
claims success but "ip addr" shows my main IP's and some aliases I set up
but the 2 in haresources are not there:

heartbeat[24892]: 2008/12/16_11:44:16 debug: RscMgmtProc 'ip-request-resp'
exited code 0
heartbeat[24892]: 2008/12/16_11:44:16 info: AnnounceTakeover(local 1,
foreign 1, reason 'ip-request-resp' (1))
heartbeat[24892]: 2008/12/16_11:44:16 debug: StartNextRemoteRscReq() -
calling hook
heartbeat[24892]: 2008/12/16_11:44:16 debug: notify_world: invoking harc:
OLD status: active
heartbeat[24892]: 2008/12/16_11:44:16 debug: Process [ip-request-resp]
started pid 25415
heartbeat[24892]: 2008/12/16_11:44:16 debug: Starting notify process
[ip-request-resp]
heartbeat[25415]: 2008/12/16_11:44:16 debug: notify_world: setting SIGCHLD
Handler to SIG_DFL
heartbeat[25415]: 2008/12/16_11:44:16 debug: notify_world: Running harc
ip-request-resp
harc[25415]:    2008/12/16_11:44:16 info: Running
/etc/ha.d/rc.d/ip-request-resp ip-request-resp
ip-request-resp[25415]: 2008/12/16_11:44:16 received ip-request-resp
IPaddr::10.1.1.31/0/eth1 OK yes
ResourceManager[25434]: 2008/12/16_11:44:16 info: Acquiring resource group:
sherman IPaddr::10.1.1.31/0/eth1
IPaddr[25460]:  2008/12/16_11:44:16 INFO:  Resource is stopped
ResourceManager[25434]: 2008/12/16_11:44:16 info: Running
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 start
ResourceManager[25434]: 2008/12/16_11:44:16 debug: Starting
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 start
Invalid netmask specification [0]
/usr/lib/heartbeat/findif version 2.1.2 Copyright Alan Robertson

Usage: /usr/lib/heartbeat/findif [-C]
Options:
     -C: Output netmask as the number of bits rather than as 4 octets.
Environment variables:
OCF_RESKEY_ip      ip address (mandatory!)
OCF_RESKEY_cidr_netmask netmask of interface
OCF_RESKEY_broadcast    broadcast address for interface
OCF_RESKEY_nic     interface to assign to
IPaddr[25553]:  2008/12/16_11:44:16 ERROR: /usr/lib/heartbeat/findif failed
[rc=1].
IPaddr[25526]:  2008/12/16_11:44:16 ERROR:  Generic error
ERROR:  Generic error
ResourceManager[25434]: 2008/12/16_11:44:16 debug:
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 start done. RC=1
ResourceManager[25434]: 2008/12/16_11:44:16 ERROR: Return code 1 from
/etc/ha.d/resource.d/IPaddr
ResourceManager[25434]: 2008/12/16_11:44:16 CRIT: Giving up resources due
to failure of IPaddr::10.1.1.31/0/eth1
ResourceManager[25434]: 2008/12/16_11:44:16 info: Releasing resource group:
sherman IPaddr::10.1.1.31/0/eth1
ResourceManager[25434]: 2008/12/16_11:44:16 info: Running
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 stop
ResourceManager[25434]: 2008/12/16_11:44:16 debug: Starting
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 stop
In IP Stop
IPaddr[25629]:  2008/12/16_11:44:17 INFO:  Success
INFO:  Success
ResourceManager[25434]: 2008/12/16_11:44:17 debug:
/etc/ha.d/resource.d/IPaddr 10.1.1.31/0/eth1 stop done. RC=0
heartbeat[24892]: 2008/12/16_11:44:17 info: Exiting ip-request-resp process
25415 returned rc 0.
heartbeat[24892]: 2008/12/16_11:44:17 debug: RscMgmtProc 'ip-request-resp'
exited code 0
heartbeat[24892]: 2008/12/16_11:44:17 info: AnnounceTakeover(local 1,
foreign 1, reason 'ip-request-resp' (1))
heartbeat[24892]: 2008/12/16_11:44:26 info: Local Resource acquisition
completed. (none)
heartbeat[24892]: 2008/12/16_11:44:26 info: local resource transition
completed.
heartbeat[24892]: 2008/12/16_11:44:26 debug: Sending hold resources msg:
all, stable=1 #<none>
heartbeat[24892]: 2008/12/16_11:44:26 info: AnnounceTakeover(local 1,
foreign 1, reason 'T_RESOURCES(us)' (1))
heartbeat[24892]: 2008/12/16_11:44:26 debug: hb_rsc_isstable:
ResourceMgmt_child_count: 0, other_is_stable: 1, takeover_in_progress: 0,
going_standby: 0, standby running(ms): 0, resourcestate: 4
heartbeat[24892]: 2008/12/16_11:44:26 debug: hb_rsc_isstable:
ResourceMgmt_child_count: 0, other_is_stable: 1, takeover_in_progress: 0,
going_standby: 0, standby running(ms): 0, resourcestate: 4
hb_standby[25413]:     2008/12/16_11:44:46 Going standby [foreign].
heartbeat[24892]: 2008/12/16_11:44:46 debug: Received standby message me
from sherman in state 0
heartbeat[24892]: 2008/12/16_11:44:46 debug: ask_for_resources: other now
unstable
heartbeat[24892]: 2008/12/16_11:44:46 info: sherman wants to go standby
[foreign]
heartbeat[24892]: 2008/12/16_11:44:46 info: i_hold_resources: 3
heartbeat[24892]: 2008/12/16_11:44:46 info: New standby state: 1
hb_standby[25677]:     2008/12/16_11:44:47 Going standby [foreign].
heartbeat[24892]: 2008/12/16_11:44:47 debug: Received standby message me
from sherman in state 1
heartbeat[24892]: 2008/12/16_11:44:47 WARN: Standby in progress- new
request from sherman ignored [10 seconds left]
heartbeat[24892]: 2008/12/16_11:44:57 WARN: No reply to standby request.
Standby request cancelled.

--

:wq!
====================================================================
Robert L. Harris                     | GPG Key ID: E344DA3B
                                          @ x-hkp://pgp.mit.edu
DISCLAIMER:
       These are MY OPINIONS             With Dreams To Be A King,
        ALONE.  I speak for              First One Should Be A Man
        no-one else.                       - Manowar


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

--

:wq!
====================================================================
Robert L. Harris                     | GPG Key ID: E344DA3B
                                         @ x-hkp://pgp.mit.edu
DISCLAIMER:
      These are MY OPINIONS             With Dreams To Be A King,
       ALONE.  I speak for              First One Should Be A Man
       no-one else.                       - Manowar


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to