Hello everybody,

    Recently,I compiled the heartbeat with version 2.1.4 source tar ball.And
I make crm on.But when executed /etc/rc.d/init.d/heartbeat start,the
heartbeat couldn't work normally.I found these errors in below section.What
should I do to deal with these errors.
I think the configuration files are correct.Because it can work well on red
hat el5.Thank you very much!


attrd[1008]: 2009/05/26_10:47:09 info: main: Starting mainloop...
crmd[1009]: 2009/05/26_10:47:35 info: crm_timer_popped: Election Trigger
(I_DC_TIMEOUT) just popped!
crmd[1009]: 2009/05/26_10:47:35 WARN: do_log: [[FSA]] Input I_DC_TIMEOUT
from crm_timer_popped() received in state (S_PENDING)
crmd[1009]: 2009/05/26_10:47:35 info: do_state_transition: State transition
S_PENDING -> S_ELECTION [ input=I_DC_TIMEOUT cause=C_TIMER_POPPED
origin=crm_timer_popped ]
heartbeat[996]: 2009/05/26_10:47:35 ERROR: ipc_bufpool_update: magic number
in head does not match.Something very bad happened, abort now, farside pid
=1009
heartbeat[996]: 2009/05/26_10:47:35 ERROR: magic=7365636f, expected
value=abcd
heartbeat[996]: 2009/05/26_10:47:35 info: pool: refcount=2,
startpos=0x8ff1a28, currpos=0x8ff1fe7,consumepos=0x8ff1f88,
endpos=0x8ff2a10, size=4096
heartbeat[996]: 2009/05/26_10:47:35 info: nmsgs=1
heartbeat[996]: 2009/05/26_10:47:35 info: ipcmsg: msg_len=151037752,
msg_buf=(nil), msg_body=(nil),msg_done=(nil), msg_private=0x3ed,
msg_ch=0x8fecad8
ccm[1004]: 2009/05/26_10:47:35 ERROR: Lost connection to heartbeat service.
Need to bail out.
crmd[1009]: 2009/05/26_10:47:35 CRIT: crmd_ha_msg_dispatch: Lost connection
to heartbeat service.
attrd[1008]: 2009/05/26_10:47:35 CRIT: attrd_ha_dispatch: Lost connection to
heartbeat service.
stonithd[1007]: 2009/05/26_10:47:35 ERROR: Disconnected with heartbeat
daemon
attrd[1008]: 2009/05/26_10:47:35 CRIT: attrd_ha_connection_destroy: Lost
connection to heartbeat service!
attrd[1008]: 2009/05/26_10:47:35 info: main: Exiting...
cib[1005]: 2009/05/26_10:47:35 ERROR: cib_ha_connection_destroy: Heartbeat
connection lost!  Exiting.
attrd[1008]: 2009/05/26_10:47:35 ERROR: attrd_cib_connection_destroy:
Connection to the CIB terminated...
crmd[1009]: 2009/05/26_10:47:35 info: mem_handle_func:IPC broken, ccm is
dead before the client!
stonithd[1007]: 2009/05/26_10:47:35 notice:
/root/heartbeat/lib/heartbeat/stonithd normally quit.
cib[1005]: 2009/05/26_10:47:35 ERROR: crm_abort: main: Triggered assert at
main.c:214 : g_hash_table_size(client_list) == 0
crmd[1009]: 2009/05/26_10:47:35 ERROR: ccm_dispatch: CCM connection appears
to have failed: rc=-1.
cib[1005]: 2009/05/26_10:47:35 WARN: main: Not all clients gone at exit
crmd[1009]: 2009/05/26_10:47:35 ERROR: do_log: [[FSA]] Input I_ERROR from
ccm_dispatch() received in state (S_ELECTION)
crmd[1009]: 2009/05/26_10:47:35 info: do_state_transition: State transition
S_ELECTION -> S_RECOVERY [ input=I_ERROR cause=C_CCM_CALLBACK
origin=ccm_dispatch ]
cib[1005]: 2009/05/26_10:47:35 info: main: Done
crmd[1009]: 2009/05/26_10:47:35 ERROR: do_recover: Action A_RECOVER
(0000000001000000) not supported
crmd[1009]: 2009/05/26_10:47:35 info: do_dc_release: DC role released
crmd[1009]: 2009/05/26_10:47:35 ERROR: do_log: [[FSA]] Input I_TERMINATE
from do_recover() received in state (S_RECOVERY)
crmd[1009]: 2009/05/26_10:47:35 info: do_state_transition: State transition
S_RECOVERY -> S_TERMINATE [ input=I_TERMINATE cause=C_FSA_INTERNAL
origin=do_recover ]
crmd[1009]: 2009/05/26_10:47:35 info: do_shutdown: All subsystems stopped,
continuing
crmd[1009]: 2009/05/26_10:47:35 info: do_lrm_control: Disconnected from the
LRM
crmd[1009]: 2009/05/26_10:47:35 info: do_ha_control: Disconnected from
Heartbeat
crmd[1009]: 2009/05/26_10:47:35 info: do_cib_control: Disconnecting CIB
crmd[1009]: 2009/05/26_10:47:35 ERROR: send_ipc_message: IPC Channel to 1005
is not connected
crmd[1009]: 2009/05/26_10:47:35 WARN: crm_log_message_adv: #=========
IPC[outbound] message start ==========#
crmd[1009]: 2009/05/26_10:47:35 WARN: MSG: Dumping message with 5 fields
crmd[1009]: 2009/05/26_10:47:35 WARN: MSG[0] : [__name__=cib_command]
crmd[1009]: 2009/05/26_10:47:35 WARN: MSG[1] : [t=cib]
crmd[1009]: 2009/05/26_10:47:35 WARN: MSG[2] : [cib_op=cib_slave]
crmd[1009]: 2009/05/26_10:47:35 WARN: MSG[3] : [cib_callid=9]
crmd[1009]: 2009/05/26_10:47:35 WARN: MSG[4] : [cib_callopt=256]
crmd[1009]: 2009/05/26_10:47:35 ERROR: cib_native_perform_op: Sending
message to CIB service FAILED
crmd[1009]: 2009/05/26_10:47:35 info: crmd_cib_connection_destroy:
Connection to the CIB terminated...
crmd[1009]: 2009/05/26_10:47:35 info: do_exit: Performing A_EXIT_0 -
gracefully exiting the CRMd
crmd[1009]: 2009/05/26_10:47:35 ERROR: do_exit: Could not recover from
internal error
crmd[1009]: 2009/05/26_10:47:35 info: free_mem: Dropping I_RELEASE_SUCCESS:
[ state=S_TERMINATE cause=C_FSA_INTERNAL origin=do_dc_release ]
crmd[1009]: 2009/05/26_10:47:35 info: free_mem: Dropping I_TERMINATE: [
state=S_TERMINATE cause=C_FSA_INTERNAL origin=do_stop ]
crmd[1009]: 2009/05/26_10:47:35 info: do_exit: [crmd] stopped (2)
heartbeat[999]: 2009/05/26_10:47:36 CRIT: Emergency Shutdown: Master Control
process died.
heartbeat[999]: 2009/05/26_10:47:36 CRIT: Killing pid 996 with SIGTERM
heartbeat[999]: 2009/05/26_10:47:36 CRIT: Killing pid 1000 with SIGTERM
heartbeat[999]: 2009/05/26_10:47:36 CRIT: Killing pid 1001 with SIGTERM
heartbeat[999]: 2009/05/26_10:47:36 CRIT: Emergency Shutdown(MCP dead):
Killing ourselves.


By the way,I found this file haresources2cib.py,en,has a bug.My
haresources's content is :143 IPaddr::192.168.9.153/24/eth0 test
When I run the script,and check the cib.xml.The content in the cib.xml is
incorrect.In second line and third line.That would be reversal of the 24 and
eth0.

auto generated by haresources2cib:
<nvpair id="IPaddr_192_168_9_153_attr_0" name="ip" value="192.168.9.153"/>
<nvpair id="IPaddr_192_168_9_153_attr_1" name="nic" value="24"/>
<nvpair id="IPaddr_192_168_9_153_attr_2" name="cidr_netmask" value="eth0"/>
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to