- **Milestone**: 4.4.2 --> 4.6.2


---

** [tickets:#1343] CLM : clmd asserted when controller switchover is invoked 
with CLM shutdown operation of node**

**Status:** unassigned
**Milestone:** 4.6.2
**Created:** Sun Apr 26, 2015 05:03 PM UTC by Srikanth R
**Last Updated:** Mon Apr 27, 2015 04:08 AM UTC
**Owner:** nobody


Changeset : 6377

Steps performed :


 -> Issued admin shutdown operation on a member node PL-5 and ensured the CLM 
agent did not respond in the start callback 

       426 16:51:12 04/26/2015 NO safApp=safClmService 
"safNode=PL-5,safCluster=myClmCluster Admin State Changed, new 
state=SHUTTING_DOWN"

 -> Invoked controller switchover by issuing admin si-swap operation.

       427 16:51:20 04/26/2015 NO safApp=safAmfService "Admin op "SI_SWAP" 
initiated for 'safSi=SC-2N,safApp=OpenSAF', invocation: 502511173633"
       428 16:51:20 04/26/2015 NO safApp=safAmfService 
"safSi=SC-2N,safApp=OpenSAF Swap initiated"

 -> clmd asserted on the quiesced controller.

Apr 26 16:51:20 CONTROLLER-1 osafamfd[2119]: NO safSi=SC-2N,safApp=OpenSAF Swap 
initiated
Apr 26 16:51:20 CONTROLLER-1 osafamfnd[2129]: NO Assigning 
'safSi=SC-2N,safApp=OpenSAF' QUIESCED to 'safSu=SC-1,safSg=2N,safApp=OpenSAF'
Apr 26 16:51:20 CONTROLLER-1 osafimmnd[2063]: NO Implementer locally 
disconnected. Marking it as doomed 80 <604, 2010f> (safSmfService)
Apr 26 16:51:20 CONTROLLER-1 osafimmnd[2063]: NO Implementer disconnected 75 
<332, 2010f> (safMsgGrpService)
Apr 26 16:51:20 CONTROLLER-1 osafimmnd[2063]: NO Implementer disconnected 80 
<604, 2010f> (safSmfService)
Apr 26 16:51:20 CONTROLLER-1 osafimmnd[2063]: NO Implementer disconnected 72 
<3, 2010f> (safLogService)
Apr 26 16:51:20 CONTROLLER-1 osafimmnd[2063]: NO Implementer disconnected 78 
<334, 2010f> (safEvtService)
Apr 26 16:51:20 CONTROLLER-1 osafamfnd[2129]: NO 
'safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF' faulted due to 'avaDown' : 
Recovery is 'nodeFailfast'
Apr 26 16:51:20 CONTROLLER-1 osafamfnd[2129]: ER 
safComp=CLM,safSu=SC-1,safSg=2N,safApp=OpenSAF Faulted due to:avaDown Recovery 
is:nodeFailfast
Apr 26 16:51:20 CONTROLLER-1 osafamfnd[2129]: Rebooting OpenSAF NodeId = 131343 
EE Name = , Reason: Component faulted: recovery is node failfast, OwnNodeId = 
131343, SupervisionTime = 60



 -> Below is the backtrace :

 (gdb) thread apply all bt
 
 Thread 4 (Thread 0x7f4caf033700 (LWP 2100)):
 #0  0x00007f4cadcd7415 in __lll_unlock_wake () from /lib64/libpthread.so.0
 #1  0x00007f4cadcd3ac4 in _L_unlock_553 () from /lib64/libpthread.so.0
 #2  0x00007f4cadcd39f7 in __pthread_mutex_unlock_usercnt () from 
/lib64/libpthread.so.0
 #3  0x00007f4caec06870 in ncsmds_adm_api () from 
/usr/lib64/libopensaf_core.so.0
 #4  0x00007f4caec1f813 in vda_chg_role_vdest () from 
/usr/lib64/libopensaf_core.so.0
 #5  0x00007f4caec1ed79 in ncsvda_api () from /usr/lib64/libopensaf_core.so.0
 #6  0x000000000041e6ec in clms_mds_change_role ()
 #7  0x0000000000404617 in amf_quiesced_state_handler ()
 #8  0x0000000000404778 in clms_amf_csi_set_callback ()
 #9  0x00007f4cae9a5ba0 in ava_hdl_cbk_rec_prc () from /usr/lib64/libSaAmf.so.0
 #10 0x00007f4cae9a530d in ava_hdl_cbk_dispatch_all () from 
/usr/lib64/libSaAmf.so.0
 #11 0x00007f4cae9a4e34 in ava_hdl_cbk_dispatch () from /usr/lib64/libSaAmf.so.0
 #12 0x00007f4cae99df14 in saAmfDispatch () at ava_api.c:261
 #13 0x0000000000411032 in main ()
 
 Thread 3 (Thread 0x7f4caf010b00 (LWP 2104)):
 #0  0x00007f4cad6164f6 in poll () from /lib64/libc.so.6
 #1  0x00007f4caebd0df1 in osaf_ppoll () from /usr/lib64/libopensaf_core.so.0
 #2  0x00007f4caebd0d27 in osaf_poll () from /usr/lib64/libopensaf_core.so.0
 #3  0x00007f4caebd0ef0 in osaf_poll_one_fd () from 
/usr/lib64/libopensaf_core.so.0
 #4  0x00007f4cadee7a04 in rda_read_msg () from /usr/lib64/opensaf/librda.so.0
 #5  0x00007f4cadee71e7 in rda_callback_task () from 
/usr/lib64/opensaf/librda.so.0
 #6  0x00007f4cadcd07b6 in start_thread () from /lib64/libpthread.so.0
 #7  0x00007f4cad61f9cd in clone () from /lib64/libc.so.6
 #8  0x0000000000000000 in ?? ()
 
 Thread 2 (Thread 0x7f4caf062b00 (LWP 2102)):
 #0  0x00007f4cad6164f6 in poll () from /lib64/libc.so.6
 #1  0x00007f4caebd0df1 in osaf_ppoll () from /usr/lib64/libopensaf_core.so.0
 #2  0x00007f4caebda7b5 in ncs_tmr_wait () from /usr/lib64/libopensaf_core.so.0
 #3  0x00007f4cadcd07b6 in start_thread () from /lib64/libpthread.so.0
 #4  0x00007f4cad61f9cd in clone () from /lib64/libc.so.6
 #5  0x0000000000000000 in ?? ()
 
 Thread 1 (Thread 0x7f4caf030b00 (LWP 2103)):
 #0  0x00007f4cad57ab55 in raise () from /lib64/libc.so.6
 #1  0x00007f4cad57c131 in abort () from /lib64/libc.so.6
 #2  0x00007f4cad5b7c2f in __libc_message () from /lib64/libc.so.6
 #3  0x00007f4cad5bd358 in malloc_printerr () from /lib64/libc.so.6
 #4  0x00007f4cad5c099d in _int_malloc () from /lib64/libc.so.6
 #5  0x00007f4cad5c23e7 in malloc () from /lib64/libc.so.6
 #6  0x00007f4caec0362a in mds_subtn_res_tbl_remove_active () from 
/usr/lib64/libopensaf_core.so.0
 ---Type <return> to continue, or q <return> to quit---
 #7  0x00007f4caec196d1 in mds_mcm_svc_up () from 
/usr/lib64/libopensaf_core.so.0
 #8  0x00007f4caec21860 in mdtm_process_discovery_events () from 
/usr/lib64/libopensaf_core.so.0
 #9  0x00007f4caec20cb5 in mdtm_process_recv_events () from 
/usr/lib64/libopensaf_core.so.0
 #10 0x00007f4cadcd07b6 in start_thread () from /lib64/libpthread.so.0
 #11 0x00007f4cad61f9cd in clone () from /lib64/libc.so.6
 #12 0x0000000000000000 in ?? ()








---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to