> > When the lrmd process falls, lrmd reboots. > > But, the monitor stops after having rebooted. > > > > In this status, lrmd cannot detect the obstacle of the resource after it. > > > > Actually, there may be little possibility that lrmd reboots. > > But, I think that it is necessary when I think about the worst case. > > > > When it is necessary for lrmd to restart, I hope that a system reboots. > > Like ccm and crmd .... > > (Possibly a similar thing may apply to a stonithd process.) > > > > Will it be difficult to hope for this improvement in Heartbeat? > > > > Best Regards, > > Hideo Yamauchi. > > Hi, > > on Yamauchi-san's test environment, lrmd would restart if it's killed (# > kill -9 PID). > for mine, lrmd is sure to restart, but crmd catches some critical error and > end up to system reboot due to the suicide code. > see attached node-a/ha-log: around line 171
Sorry for missing attached. > > * kill -9 lrmd process > heartbeat[28075]: 2008/06/17_16:48:51 WARN: Managed > /usr/lib64/heartbeat/lrmd -r process 28086 killed by signal 9 [SIGKILL - > Kill, unblockable]. > > * crmd lose the connection with lrm > crmd[28089]: 2008/06/17_16:48:51 CRIT: lrm_dispatch: LRM Connection failed > > If you remember something, please let me know. > The version of Heartbeat, OS, glib are the same. > Can anything make an impact? > > Which is the appropriate behavior for heartbeat when lrmd is killed; > restart lrmd or system reboot? > > Thanks, > Junko > > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems
hb_report.tar.gz
Description: Binary data
# gdb /usr/lib64/heartbeat/crmd 27724
GNU gdb Red Hat Linux (6.5-25.el5rh)
Copyright (C) 2006 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details.
This GDB was configured as "x86_64-redhat-linux-gnu"...Using host libthread_db
library "/lib64/libthread_db.so.1".
Attaching to program: /usr/lib64/heartbeat/crmd, process 27724
Reading symbols from /usr/lib64/libplumb.so.1...done.
Loaded symbols for /usr/lib64/libplumb.so.1
Reading symbols from /usr/lib64/libcrmcommon.so.1...done.
Loaded symbols for /usr/lib64/libcrmcommon.so.1
Reading symbols from /usr/lib64/libpe_rules.so.2...done.
Loaded symbols for /usr/lib64/libpe_rules.so.2
Reading symbols from /usr/lib64/libcib.so.1...done.
Loaded symbols for /usr/lib64/libcib.so.1
Reading symbols from /usr/lib64/libhbclient.so.1...done.
Loaded symbols for /usr/lib64/libhbclient.so.1
Reading symbols from /usr/lib64/liblrm.so.0...done.
Loaded symbols for /usr/lib64/liblrm.so.0
Reading symbols from /usr/lib64/libpils.so.1...done.
Loaded symbols for /usr/lib64/libpils.so.1
Reading symbols from /lib64/libglib-2.0.so.0...done.
Loaded symbols for /lib64/libglib-2.0.so.0
Reading symbols from /usr/lib64/libccmclient.so.1...done.
Loaded symbols for /usr/lib64/libccmclient.so.1
Reading symbols from /usr/lib64/libbz2.so.1...done.
Loaded symbols for /usr/lib64/libbz2.so.1
Reading symbols from /usr/lib64/libz.so.1...done.
Loaded symbols for /usr/lib64/libz.so.1
Reading symbols from /usr/lib64/libxml2.so.2...done.
Loaded symbols for /usr/lib64/libxml2.so.2
Reading symbols from /lib64/libc.so.6...done.
Loaded symbols for /lib64/libc.so.6
Reading symbols from /lib64/libuuid.so.1...done.
Loaded symbols for /lib64/libuuid.so.1
Reading symbols from /lib64/libpam.so.0...done.
Loaded symbols for /lib64/libpam.so.0
Reading symbols from /lib64/librt.so.1...done.
Loaded symbols for /lib64/librt.so.1
Reading symbols from /usr/lib64/libltdl.so.3...done.
Loaded symbols for /usr/lib64/libltdl.so.3
Reading symbols from /lib64/libdl.so.2...done.
Loaded symbols for /lib64/libdl.so.2
Reading symbols from /lib64/libm.so.6...done.
Loaded symbols for /lib64/libm.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2
Reading symbols from /lib64/libaudit.so.0...done.
Loaded symbols for /lib64/libaudit.so.0
Reading symbols from /lib64/libpthread.so.0...done.
[Thread debugging using libthread_db enabled]
[New Thread 46912513575664 (LWP 27724)]
Loaded symbols for /lib64/libpthread.so.0
Reading symbols from /lib64/libnss_files.so.2...done.
Loaded symbols for /lib64/libnss_files.so.2
Reading symbols from /usr/lib64/pils/plugins/InterfaceMgr/generic.so...done.
Loaded symbols for /usr/lib64/pils/plugins/InterfaceMgr/generic.so
Reading symbols from /usr/lib64/heartbeat/plugins/HBcompress/zlib.so...done.
Loaded symbols for /usr/lib64/heartbeat/plugins/HBcompress/zlib.so
0x0000003ad38c585f in poll () from /lib64/libc.so.6
(gdb) b lrm.c:87
Breakpoint 1 at 0x417dc0: file lrm.c, line 87.
(gdb) c
Continuing.
[Switching to Thread 46912513575664 (LWP 27724)]
Breakpoint 1, do_lrm_control (action=144115188075855872, cause=C_FSA_INTERNAL,
cur_state=S_TERMINATE, current_input=I_TERMINATE,
msg_data=0x16431498) at lrm.c:92
92 {
(gdb) bt
#0 do_lrm_control (action=144115188075855872, cause=C_FSA_INTERNAL,
cur_state=S_TERMINATE, current_input=I_TERMINATE,
msg_data=0x16431498) at lrm.c:92
#1 0x0000000000405245 in do_fsa_action (fsa_data=0x16431498,
an_action=144115188075855872, function=0x417dc0 <do_lrm_control>)
at fsa.c:176
#2 0x0000000000405f95 in s_crmd_fsa_actions (fsa_data=0x16431498) at fsa.c:526
#3 0x0000000000406267 in s_crmd_fsa (cause=C_FSA_INTERNAL) at fsa.c:311
#4 0x000000000040cd57 in crm_fsa_trigger (user_data=<value optimized out>) at
callbacks.c:654
#5 0x00002aaaaaadd488 in G_TRIG_dispatch (source=<value optimized out>,
callback=<value optimized out>, user_data=<value optimized out>)
at GSource.c:1388
#6 0x0000003ad502cf44 in g_main_context_dispatch () from
/lib64/libglib-2.0.so.0
#7 0x0000003ad502fd7d in g_main_context_check () from /lib64/libglib-2.0.so.0
#8 0x0000003ad503008a in g_main_loop_run () from /lib64/libglib-2.0.so.0
#9 0x0000000000403e37 in crmd_init () at main.c:154
#10 0x0000003ad381d8a4 in __libc_start_main () from /lib64/libc.so.6
#11 0x0000000000403bd9 in _start ()
(gdb) quit
The program is running. Quit anyway (and detach it)? (y or n) y
Detaching from program: /usr/lib64/heartbeat/crmd, process 27724
[EMAIL PROTECTED] crmd]# clear
[EMAIL PROTECTED] crmd]# gdb /usr/lib64/heartbeat/crmd 28089
GNU gdb Red Hat Linux (6.5-25.el5rh)
Copyright (C) 2006 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details.
This GDB was configured as "x86_64-redhat-linux-gnu"...Using host libthread_db
library "/lib64/libthread_db.so.1".
Attaching to program: /usr/lib64/heartbeat/crmd, process 28089
Reading symbols from /usr/lib64/libplumb.so.1...done.
Loaded symbols for /usr/lib64/libplumb.so.1
Reading symbols from /usr/lib64/libcrmcommon.so.1...done.
Loaded symbols for /usr/lib64/libcrmcommon.so.1
Reading symbols from /usr/lib64/libpe_rules.so.2...done.
Loaded symbols for /usr/lib64/libpe_rules.so.2
Reading symbols from /usr/lib64/libcib.so.1...done.
Loaded symbols for /usr/lib64/libcib.so.1
Reading symbols from /usr/lib64/libhbclient.so.1...done.
Loaded symbols for /usr/lib64/libhbclient.so.1
Reading symbols from /usr/lib64/liblrm.so.0...done.
Loaded symbols for /usr/lib64/liblrm.so.0
Reading symbols from /usr/lib64/libpils.so.1...done.
Loaded symbols for /usr/lib64/libpils.so.1
Reading symbols from /lib64/libglib-2.0.so.0...done.
Loaded symbols for /lib64/libglib-2.0.so.0
Reading symbols from /usr/lib64/libccmclient.so.1...done.
Loaded symbols for /usr/lib64/libccmclient.so.1
Reading symbols from /usr/lib64/libbz2.so.1...done.
Loaded symbols for /usr/lib64/libbz2.so.1
Reading symbols from /usr/lib64/libz.so.1...done.
Loaded symbols for /usr/lib64/libz.so.1
Reading symbols from /usr/lib64/libxml2.so.2...done.
Loaded symbols for /usr/lib64/libxml2.so.2
Reading symbols from /lib64/libc.so.6...done.
Loaded symbols for /lib64/libc.so.6
Reading symbols from /lib64/libuuid.so.1...done.
Loaded symbols for /lib64/libuuid.so.1
Reading symbols from /lib64/libpam.so.0...done.
Loaded symbols for /lib64/libpam.so.0
Reading symbols from /lib64/librt.so.1...done.
Loaded symbols for /lib64/librt.so.1
Reading symbols from /usr/lib64/libltdl.so.3...done.
Loaded symbols for /usr/lib64/libltdl.so.3
Reading symbols from /lib64/libdl.so.2...done.
Loaded symbols for /lib64/libdl.so.2
Reading symbols from /lib64/libm.so.6...done.
Loaded symbols for /lib64/libm.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2
Reading symbols from /lib64/libaudit.so.0...done.
Loaded symbols for /lib64/libaudit.so.0
Reading symbols from /lib64/libpthread.so.0...done.
[Thread debugging using libthread_db enabled]
[New Thread 46912513575664 (LWP 28089)]
Loaded symbols for /lib64/libpthread.so.0
Reading symbols from /lib64/libnss_files.so.2...done.
Loaded symbols for /lib64/libnss_files.so.2
Reading symbols from /usr/lib64/pils/plugins/InterfaceMgr/generic.so...done.
Loaded symbols for /usr/lib64/pils/plugins/InterfaceMgr/generic.so
Reading symbols from /usr/lib64/heartbeat/plugins/HBcompress/zlib.so...done.
Loaded symbols for /usr/lib64/heartbeat/plugins/HBcompress/zlib.so
0x0000003ad38c585f in poll () from /lib64/libc.so.6
(gdb) b lrm.c:87
Breakpoint 1 at 0x417dc0: file lrm.c, line 87.
(gdb) c
Continuing.
[Switching to Thread 46912513575664 (LWP 28089)]
Breakpoint 1, do_lrm_control (action=144115188075855872, cause=C_FSA_INTERNAL,
cur_state=S_TERMINATE, current_input=I_TERMINATE,
msg_data=0x198f3518) at lrm.c:92
92 {
(gdb) bt
#0 do_lrm_control (action=144115188075855872, cause=C_FSA_INTERNAL,
cur_state=S_TERMINATE, current_input=I_TERMINATE,
msg_data=0x198f3518) at lrm.c:92
#1 0x0000000000405245 in do_fsa_action (fsa_data=0x198f3518,
an_action=144115188075855872, function=0x417dc0 <do_lrm_control>)
at fsa.c:176
#2 0x0000000000405f95 in s_crmd_fsa_actions (fsa_data=0x198f3518) at fsa.c:526
#3 0x0000000000406267 in s_crmd_fsa (cause=C_FSA_INTERNAL) at fsa.c:311
#4 0x000000000040cd57 in crm_fsa_trigger (user_data=<value optimized out>) at
callbacks.c:654
#5 0x00002aaaaaadd488 in G_TRIG_dispatch (source=<value optimized out>,
callback=<value optimized out>, user_data=<value optimized out>)
at GSource.c:1388
#6 0x0000003ad502cf44 in g_main_context_dispatch () from
/lib64/libglib-2.0.so.0
#7 0x0000003ad502fd7d in g_main_context_check () from /lib64/libglib-2.0.so.0
#8 0x0000003ad503008a in g_main_loop_run () from /lib64/libglib-2.0.so.0
#9 0x0000000000403e37 in crmd_init () at main.c:154
#10 0x0000003ad381d8a4 in __libc_start_main () from /lib64/libc.so.6
#11 0x0000000000403bd9 in _start ()
(gdb)
_______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
