On Sun, Jun 17, 2007 at 12:32:20AM +0100, Matthew Baker wrote:
> Hello,
>
> I'm having a problem with ipfail failing to start and I can't seem to
> find a an answer through searching net/forums.
>
> Syslog displays the first 2 messages several times before giving up with
> the last message.
>
> heartbeat: [32428]: info: Starting
> "/usr/local/packages/heartbeat/lib/heartbeat/ipfail" as uid 17 gid 65
> (pid 32428)
> heartbeat: [31636]: WARN: Exiting
> /usr/local/packages/heartbeat/lib/heartbeat/ipfail process 32428
> returned rc 1.
> heartbeat: [31636]: ERROR: Client
> /usr/local/packages/heartbeat/lib/heartbeat/ipfail "respawning too fast"
>
> If I run that command as hacluster I get:
>
> ipfail[3508]: 2007/06/16_23:09:17 debug: PID=3508
> ipfail[3508]: 2007/06/16_23:09:17 debug: Signing in with heartbeat
> ipfail[3508]: 2007/06/16_23:09:17 ERROR: Cannot sign on with heartbeat
> ipfail[3508]: 2007/06/16_23:09:17 ERROR: REASON: hb_api_signon: Can't
> initiate connection to heartbeat
>
> Same error is listed for ccm & cibmon when I run BasicSanityCheck.
>
> Config
>
> logfacility local5
> node node1 node2
> keepalive 1
> deadtime 31
> warntime 20
> ucast eth1 192.168.10.10
> ucast eth1 192.168.10.11
> ping 192.168.10.1
> auto_failback no
> respawn hacluster /usr/local/packages/heartbeat/lib/heartbeat/ipfail
>
> User and permissions
>
> [EMAIL PROTECTED] root]# egrep '(17|65)' /etc/{passwd,group}
> /etc/passwd:nfsnobody:x:65534:65534:Anonymous NFS
> User:/var/lib/nfs:/sbin/nologin
> /etc/passwd:hacluster:x:17:65:High Availability
> User:/home/hacluster:/bin/bash
> /etc/group:nfsnobody:x:65534:
> /etc/group:haclient:x:65:
>
> I pretty sure this has been OK in the past (pre version 2.0.7 which is
> the current version in use). The heartbeat package is installed from
> source and is running on RHEL3.
>
> Previous list posts suggest this is a permission problem. I've changed
> the perms on /var/run/heartbeat, the binary for ipfail and even the fifo
> in /var/lib/heartbeat, but I just feel I'm stabbing in the dark.
>
> Any suggestions would be greatly welcomed.
Strange. It must be a permission problem. Could you turn debugging
on and post results? If you start ipfail as root does that help?
>
> Cheers,
>
> Matt
>
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
--
Dejan
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems