I am running a pacemaker/heartbeat cluster on Debian. Heartbeat is 3.0.4-1 (from wheezy)
from my daemon.log Apr 17 17:07:07 s1 attrd: [2692]: info: ha_msg_dispatch: Lost connection to heartbeat service. Apr 17 17:07:07 s1 stonithd: [2691]: info: ha_msg_dispatch: Lost connection to heartbeat service. Apr 17 17:07:07 s1 crmd: [2693]: info: ha_msg_dispatch: Lost connection to heartbeat service. Apr 17 17:07:07 s1 cib: [2689]: info: ha_msg_dispatch: Lost connection to heartbeat service. Apr 17 17:07:07 s1 ccm: [2688]: ERROR: Lost connection to heartbeat service. Need to bail out. Apr 17 17:07:07 s1 cib: [2689]: info: mem_handle_func:IPC broken, ccm is dead before the client! Apr 17 17:07:07 s1 cib: [2689]: ERROR: cib_ccm_dispatch: CCM connection appears to have failed: rc=-1. Apr 17 17:07:07 s1 cib: [2689]: ERROR: cib_ccm_dispatch: Exiting to recover from CCM connection failure Apr 17 17:07:07 s1 attrd: [2692]: info: cib_native_msgready: Lost connection to the CIB service [2689]. Apr 17 17:07:07 s1 attrd: [2692]: CRIT: cib_native_dispatch: Lost connection to the CIB service [2689/callback]. Apr 17 17:07:07 s1 attrd: [2692]: CRIT: cib_native_dispatch: Lost connection to the CIB service [2689/command]. Apr 17 17:07:07 s1 crmd: [2693]: info: mem_handle_func:IPC broken, ccm is dead before the client! Apr 17 17:07:07 s1 crmd: [2693]: info: cib_native_msgready: Lost connection to the CIB service [2689]. Apr 17 17:07:07 s1 crmd: [2693]: CRIT: cib_native_dispatch: Lost connection to the CIB service [2689/callback]. Apr 17 17:07:07 s1 crmd: [2693]: CRIT: cib_native_dispatch: Lost connection to the CIB service [2689/command]. Apr 17 17:07:07 s1 crmd: [2693]: ERROR: crmd_cib_connection_destroy: Connection to the CIB terminated... Apr 17 17:07:07 s1 crmd: [2693]: ERROR: ccm_dispatch: CCM connection appears to have failed: rc=-1. Apr 17 17:07:07 s1 attrd: [2692]: ERROR: attrd_cib_connection_destroy: Connection to the CIB terminated... Apr 17 17:07:07 s1 crmd: [2693]: ERROR: do_log: FSA: Input I_ERROR from crmd_cib_connection_destroy() received in state S_NOT_DC Apr 17 17:07:07 s1 crmd: [2693]: info: do_state_transition: State transition S_NOT_DC -> S_RECOVERY [ input=I_ERROR cause=C_FSA_INTERNAL origin=crmd_cib_connection_destroy ] Apr 17 17:07:07 s1 crmd: [2693]: ERROR: do_recover: Action A_RECOVER (0000000001000000) not supported Apr 17 17:07:07 s1 crmd: [2693]: ERROR: do_log: FSA: Input I_ERROR from ccm_dispatch() received in state S_RECOVERY Apr 17 17:07:07 s1 crmd: [2693]: info: do_dc_release: DC role released Apr 17 17:07:07 s1 crmd: [2693]: info: do_te_control: Transitioner is now inactive Apr 17 17:07:07 s1 crmd: [2693]: ERROR: do_log: FSA: Input I_TERMINATE from do_recover() received in state S_RECOVERY Apr 17 17:07:07 s1 crmd: [2693]: info: do_state_transition: State transition S_RECOVERY -> S_TERMINATE [ input=I_TERMINATE cause=C_FSA_INTERNAL origin=do_recover ] Apr 17 17:07:07 s1 crmd: [2693]: info: do_shutdown: All subsystems stopped, continuing debug log showed some pacemaker stuff: Apr 17 17:07:07 s1 attrd: [2692]: debug: xmlfromIPC: Peer disconnected Apr 17 17:07:07 s1 crmd: [2693]: debug: xmlfromIPC: Peer disconnected Apr 17 17:07:07 s1 crmd: [2693]: debug: s_crmd_fsa: Processing I_ERROR: [ state=S_NOT_DC cause=C_FSA_INTERNAL origin=crmd_cib_connection_destroy ] Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_ERROR Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_DC_TIMER_STOP Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_INTEGRATE_TIMER_STOP Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_FINALIZE_TIMER_STOP Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_RECOVER Apr 17 17:07:07 s1 crmd: [2693]: debug: s_crmd_fsa: Processing I_ERROR: [ state=S_RECOVERY cause=C_CCM_CALLBACK origin=ccm_dispatch ] Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_ERROR Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_DC_TIMER_STOP Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_DC_RELEASE Apr 17 17:07:07 s1 crmd: [2693]: debug: do_dc_release: Releasing the role of DC Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_DC_RELEASED Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_PE_STOP Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_TE_STOP Apr 17 17:07:07 s1 crmd: [2693]: debug: cib_client_del_notify_callback: Removing callback for cib_diff_notify events Apr 17 17:07:07 s1 crmd: [2693]: debug: s_crmd_fsa: Processing I_TERMINATE: [ state=S_RECOVERY cause=C_FSA_INTERNAL origin=do_recover ] Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_ERROR Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_DC_TIMER_STOP Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_INTEGRATE_TIMER_STOP Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_FINALIZE_TIMER_STOP Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_SHUTDOWN Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace: // A_LRM_DISCONNECT Apr 17 17:07:07 s1 crmd: [2693]: debug: verify_stopped: Checking for active resources before exit I have the core dump from the heartbeat process. Where should i send it? Thanks for any help Mark P _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
