Dejan Muhamedagic wrote:
> On Sun, Jun 17, 2007 at 12:32:20AM +0100, Matthew Baker wrote:
>> Hello,
>>
>> I'm having a problem with ipfail failing to start and I can't seem to
>> find a an answer through searching net/forums.
>>
>> Syslog displays the first 2 messages several times before giving up with
>> the last message.
>>
>> heartbeat: [32428]: info: Starting
>> "/usr/local/packages/heartbeat/lib/heartbeat/ipfail" as uid 17  gid 65
>> (pid 32428)
>> heartbeat: [31636]: WARN: Exiting
>> /usr/local/packages/heartbeat/lib/heartbeat/ipfail process 32428
>> returned rc 1.
>> heartbeat: [31636]: ERROR: Client
>> /usr/local/packages/heartbeat/lib/heartbeat/ipfail "respawning too fast"
>>
>> If I run that command as hacluster I get:
>>
>> ipfail[3508]: 2007/06/16_23:09:17 debug: PID=3508
>> ipfail[3508]: 2007/06/16_23:09:17 debug: Signing in with heartbeat
>> ipfail[3508]: 2007/06/16_23:09:17 ERROR: Cannot sign on with heartbeat
>> ipfail[3508]: 2007/06/16_23:09:17 ERROR: REASON: hb_api_signon: Can't
>> initiate connection  to heartbeat
>>
>> Same error is listed for ccm & cibmon when I run BasicSanityCheck.
>>
>> Config
>>
>> logfacility local5
>> node node1 node2
>> keepalive 1
>> deadtime 31
>> warntime 20
>> ucast eth1 192.168.10.10
>> ucast eth1 192.168.10.11
>> ping 192.168.10.1
>> auto_failback no
>> respawn hacluster /usr/local/packages/heartbeat/lib/heartbeat/ipfail
>>
>> User and permissions
>>
>> [EMAIL PROTECTED] root]# egrep '(17|65)'  /etc/{passwd,group}
>> /etc/passwd:nfsnobody:x:65534:65534:Anonymous NFS
>> User:/var/lib/nfs:/sbin/nologin
>> /etc/passwd:hacluster:x:17:65:High Availability
>> User:/home/hacluster:/bin/bash
>> /etc/group:nfsnobody:x:65534:
>> /etc/group:haclient:x:65:
>>
>> I pretty sure this has been OK in the past (pre version 2.0.7 which is
>> the current version in use). The heartbeat package is installed from
>> source and is running on RHEL3.
>>
>> Previous list posts suggest this is a permission problem. I've changed
>> the perms on /var/run/heartbeat, the binary for ipfail and even the fifo
>> in /var/lib/heartbeat, but I just feel I'm stabbing in the dark.
>>
>> Any suggestions would be greatly welcomed.
>
> Strange. It must be a permission problem. Could you turn debugging
> on and post results? If you start ipfail as root does that help?
>
>> Cheers,
>>
>> Matt
>>
>> _______________________________________________
>> Linux-HA mailing list
>> [email protected]
>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
>> See also: http://linux-ha.org/ReportingProblems
>

Hi Thanks for the response. Sorry for not replying sooner.

Interestingly I have exactly the same problem on both servers. I've
also upgraded to version 2.0.8 and setup channel bonding since the last
email.


Running ipfail as root (with a tail -f /var/log/ha.log (syslog))

[EMAIL PROTECTED] root]# /usr/local/packages/heartbeat/lib/heartbeat/ipfail
ipfail[19996]: 2007/06/21_16:42:13 debug: PID=19996
ipfail[19996]: 2007/06/21_16:42:13 debug: Signing in with heartbeat
ipfail[19996]: 2007/06/21_16:42:13 ERROR: No managed resources
[EMAIL PROTECTED] root]# Jun 21 16:42:13 node1 heartbeat: [6373]: WARN: Client
[ipfail] pid 19996 failed authorization [client failed authorization]
Jun 21 16:42:13 node1 heartbeat: [6373]: ERROR:
api_process_registration_msg: cannot add client(ipfail)
Jun 21 16:42:13 node1 ipfail: [19996]: ERROR: No managed resources

This message is the same on both nodes.

Debug log can be found at http://pastebin.com/933452

TIA,

Matt


-- 

    Matthew Baker, UNIX Systems Administrator
----------------------------------------------------
Institute for Learning and Research Technology (ILRT)
A: University of Bristol,
   8-10 Berkeley Square,
   Bristol.
   BS8 1HH
W: http://www.ilrt.bristol.ac.uk
E: [EMAIL PROTECTED]
T: +44 (0)117 928 7121
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to