- **Milestone**: 4.4.2 --> 4.6.2
---
** [tickets:#1343] CLM : clmd asserted when controller switchover is invoked
with CLM shutdown operation of node**
**Status:** unassigned
**Milestone:** 4.6.2
**Created:** Sun Apr 26, 2015 05:03 PM UTC by Srikanth R
**Last Updated:** Mon Apr 27, 2015 04:08 AM UTC
**Owner:** nobody
Changeset : 6377
Steps performed :
-> Issued admin shutdown operation on a member node PL-5 and ensured the CLM
agent did not respond in the start callback
426 16:51:12 04/26/2015 NO safApp=safClmService
"safNode=PL-5,safCluster=myClmCluster Admin State Changed, new
state=SHUTTING_DOWN"
-> Invoked controller switchover by issuing admin si-swap operation.
427 16:51:20 04/26/2015 NO safApp=safAmfService "Admin op "SI_SWAP"
initiated for 'safSi=SC-2N,safApp=OpenSAF', invocation: 502511173633"
428 16:51:20 04/26/2015 NO safApp=safAmfService
"safSi=SC-2N,safApp=OpenSAF Swap initiated"
-> clmd asserted on the quiesced controller.
Apr 26 16:51:20 CONTROLLER-1 osafamfd[2119]: NO safSi=SC-2N,safApp=OpenSAF Swap
initiated
Apr 26 16:51:20 CONTROLLER-1 osafamfnd[2129]: NO Assigning
'safSi=SC-2N,safApp=OpenSAF' QUIESCED to 'safSu=SC-1,safSg=2N,safApp=OpenSAF'
Apr 26 16:51:20 CONTROLLER-1 osafimmnd[2063]: NO Implementer locally
disconnected. Marking it as doomed 80 <604, 2010f> (safSmfService)
Apr 26 16:51:20 CONTROLLER-1 osafimmnd[2063]: NO Implementer disconnected 75
<332, 2010f> (safMsgGrpService)
Apr 26 16:51:20 CONTROLLER-1 osafimmnd[2063]: NO Implementer disconnected 80
<604, 2010f> (safSmfService)
Apr 26 16:51:20 CONTROLLER-1 osafimmnd[2063]: NO Implementer disconnected 72
<3, 2010f> (safLogService)
Apr 26 16:51:20 CONTROLLER-1 osafimmnd[2063]: NO Implementer disconnected 78
<334, 2010f> (safEvtService)
Apr 26 16:51:20 CONTROLLER-1 osafamfnd[2129]: NO
'safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown' :
Recovery is 'nodeFailfast'
Apr 26 16:51:20 CONTROLLER-1 osafamfnd[2129]: ER
safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF Faulted due to:avaDown Recovery
is:nodeFailfast
Apr 26 16:51:20 CONTROLLER-1 osafamfnd[2129]: Rebooting OpenSAF NodeId = 131343
EE Name = , Reason: Component faulted: recovery is node failfast, OwnNodeId =
131343, SupervisionTime = 60
-> Below is the backtrace :
(gdb) thread apply all bt
Thread 4 (Thread 0x7f4caf033700 (LWP 2100)):
#0 0x00007f4cadcd7415 in __lll_unlock_wake () from /lib64/libpthread.so.0
#1 0x00007f4cadcd3ac4 in _L_unlock_553 () from /lib64/libpthread.so.0
#2 0x00007f4cadcd39f7 in __pthread_mutex_unlock_usercnt () from
/lib64/libpthread.so.0
#3 0x00007f4caec06870 in ncsmds_adm_api () from
/usr/lib64/libopensaf_core.so.0
#4 0x00007f4caec1f813 in vda_chg_role_vdest () from
/usr/lib64/libopensaf_core.so.0
#5 0x00007f4caec1ed79 in ncsvda_api () from /usr/lib64/libopensaf_core.so.0
#6 0x000000000041e6ec in clms_mds_change_role ()
#7 0x0000000000404617 in amf_quiesced_state_handler ()
#8 0x0000000000404778 in clms_amf_csi_set_callback ()
#9 0x00007f4cae9a5ba0 in ava_hdl_cbk_rec_prc () from /usr/lib64/libSaAmf.so.0
#10 0x00007f4cae9a530d in ava_hdl_cbk_dispatch_all () from
/usr/lib64/libSaAmf.so.0
#11 0x00007f4cae9a4e34 in ava_hdl_cbk_dispatch () from /usr/lib64/libSaAmf.so.0
#12 0x00007f4cae99df14 in saAmfDispatch () at ava_api.c:261
#13 0x0000000000411032 in main ()
Thread 3 (Thread 0x7f4caf010b00 (LWP 2104)):
#0 0x00007f4cad6164f6 in poll () from /lib64/libc.so.6
#1 0x00007f4caebd0df1 in osaf_ppoll () from /usr/lib64/libopensaf_core.so.0
#2 0x00007f4caebd0d27 in osaf_poll () from /usr/lib64/libopensaf_core.so.0
#3 0x00007f4caebd0ef0 in osaf_poll_one_fd () from
/usr/lib64/libopensaf_core.so.0
#4 0x00007f4cadee7a04 in rda_read_msg () from /usr/lib64/opensaf/librda.so.0
#5 0x00007f4cadee71e7 in rda_callback_task () from
/usr/lib64/opensaf/librda.so.0
#6 0x00007f4cadcd07b6 in start_thread () from /lib64/libpthread.so.0
#7 0x00007f4cad61f9cd in clone () from /lib64/libc.so.6
#8 0x0000000000000000 in ?? ()
Thread 2 (Thread 0x7f4caf062b00 (LWP 2102)):
#0 0x00007f4cad6164f6 in poll () from /lib64/libc.so.6
#1 0x00007f4caebd0df1 in osaf_ppoll () from /usr/lib64/libopensaf_core.so.0
#2 0x00007f4caebda7b5 in ncs_tmr_wait () from /usr/lib64/libopensaf_core.so.0
#3 0x00007f4cadcd07b6 in start_thread () from /lib64/libpthread.so.0
#4 0x00007f4cad61f9cd in clone () from /lib64/libc.so.6
#5 0x0000000000000000 in ?? ()
Thread 1 (Thread 0x7f4caf030b00 (LWP 2103)):
#0 0x00007f4cad57ab55 in raise () from /lib64/libc.so.6
#1 0x00007f4cad57c131 in abort () from /lib64/libc.so.6
#2 0x00007f4cad5b7c2f in __libc_message () from /lib64/libc.so.6
#3 0x00007f4cad5bd358 in malloc_printerr () from /lib64/libc.so.6
#4 0x00007f4cad5c099d in _int_malloc () from /lib64/libc.so.6
#5 0x00007f4cad5c23e7 in malloc () from /lib64/libc.so.6
#6 0x00007f4caec0362a in mds_subtn_res_tbl_remove_active () from
/usr/lib64/libopensaf_core.so.0
---Type <return> to continue, or q <return> to quit---
#7 0x00007f4caec196d1 in mds_mcm_svc_up () from
/usr/lib64/libopensaf_core.so.0
#8 0x00007f4caec21860 in mdtm_process_discovery_events () from
/usr/lib64/libopensaf_core.so.0
#9 0x00007f4caec20cb5 in mdtm_process_recv_events () from
/usr/lib64/libopensaf_core.so.0
#10 0x00007f4cadcd07b6 in start_thread () from /lib64/libpthread.so.0
#11 0x00007f4cad61f9cd in clone () from /lib64/libc.so.6
#12 0x0000000000000000 in ?? ()
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.------------------------------------------------------------------------------
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets