Dejan,

I started there. However, the problem I had was that I could not install 2.1.3 on Fedora Core 1 since it needed later versions of other RPMs. I can make 2.1.3 on FC1 but when I try to package heartbeat, I get missing libnet-devel, openhpi-devel, gnutls-devel, OpenIPMI-devel. Is there a way around this?

Gary

Dejan Muhamedagic wrote:
Hi,

On Fri, Jan 11, 2008 at 10:22:48AM -0500, Gary Schlachter wrote:
I have a problem with heartbeat dying. I have a 3 node cluster running HA 2.0.8 on Fedora Core 1. They are providing a single IP address resource. They are using eth0 as the heartbeat mechanism. If I disconnect the eth0 cable from the node which is providing the IP address, one of the other nodes correctly begins providing it. However, shortly after disconnecting the eth0 cable, the heartbeat process (and others) die. The

This has been fixed a few months ago. The fix is in the 2.1.3
release. Could you please use the new release.

Thanks,

Dejan

key area in the ha-debug log looks like the following:

pengine[4293]: 2008/01/11_09:50:22 info: determine_online_status: Node loneranger.us.big.net is online pengine[4293]: 2008/01/11_09:50:22 info: native_print: SharedIP (heartbeat::ocf:IPaddr): Started loneranger.us.big.net pengine[4293]: 2008/01/11_09:50:22 notice: StopRsc: loneranger.us.big.net Stop SharedIP crmd[9543]: 2008/01/11_09:50:22 info: do_state_transition: loneranger.us.big.net: State transition S_POLICY_ENGINE ->S_TRANSITION_ENGINE [input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=route_message ] pengine[4293]: 2008/01/11_09:50:22 info: process_pe_message: Transition 0: PEngine Input stored in: /var/lib/heartbeat/pengine/pe-input-137.bz2 tengine[4292]: 2008/01/11_09:50:22 info: unpack_graph: Unpacked transition 0: 1 actions in 1 synapses tengine[4292]: 2008/01/11_09:50:22 info: send_rsc_command: Initiating action 3: SharedIP_stop_0 on loneranger.us.big.net crmd[9543]: 2008/01/11_09:50:22 info: do_lrm_rsc_op: Performing op=SharedIP_stop_0 key=3:0:994066a9-4cae-49a4-abad-37f3e0b84b3e) IPaddr[4300]: 2008/01/11_09:50:22 INFO: /sbin/ifconfig eth0:0 10.1.2.50 down lrmd[9540]: 2008/01/11_09:50:22 info: RA output: (SharedIP:stop:stderr) SIOCDELRT: No such process

crmd[9543]: 2008/01/11_09:50:22 info: process_lrm_event: LRM operation SharedIP_stop_0 (call=4, rc=0) complete cib[9539]: 2008/01/11_09:50:22 info: cib_diff_notify: Update (client: 9543, call:32): 0.30.317 -> 0.30.318 (ok) cib[4315]: 2008/01/11_09:50:22 info: write_cib_contents: Wrote version 0.30.318 of the CIB to disk (digest: ad7329b3cddc6a9bbd96deb332a3d08f) tengine[4292]: 2008/01/11_09:50:22 info: te_update_diff: Processing diff (cib_update): 0.30.317 -> 0.30.318 tengine[4292]: 2008/01/11_09:50:22 info: match_graph_event: Action SharedIP_stop_0 (3) confirmed on c8608d41-66b2-4115-9043-4a8423b0d562 tengine[4292]: 2008/01/11_09:50:22 info: run_graph: Transition 0: (Complete=1, Pending=0, Fired=0, Skipped=0, Incomplete=0) tengine[4292]: 2008/01/11_09:50:22 info: notify_crmd: Transition 0 status: te_complete - <null> crmd[9543]: 2008/01/11_09:50:22 info: do_state_transition: loneranger.us.big.net: State transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS cause=C_IPC_MESSAGE origin=route_message ] heartbeat[9527]: 2008/01/11_09:54:27 ERROR: Cannot write to media pipe 0: Resource temporarily unavailable
heartbeat[9527]: 2008/01/11_09:54:27 ERROR: Shutting down.
heartbeat[9527]: 2008/01/11_09:54:27 ERROR: Cannot write to media pipe 0: Resource temporarily unavailable
heartbeat[9527]: 2008/01/11_09:54:27 ERROR: Shutting down.
heartbeat[9527]: 2008/01/11_09:54:27 ERROR: Cannot write to media pipe 0: Resource temporarily unavailable
heartbeat[9527]: 2008/01/11_09:54:27 ERROR: Shutting down.

The last messages repeat for a very long time then most daemons eventually stop.


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to