[tickets] [opensaf:tickets] #2470 imm: IMMND coredump when discarding client

2017-05-28 Thread Hung Nguyen



---

** [tickets:#2470] imm: IMMND coredump when discarding client**

**Status:** accepted
**Milestone:** 5.17.06
**Created:** Mon May 29, 2017 04:05 AM UTC by Hung Nguyen
**Last Updated:** Mon May 29, 2017 04:05 AM UTC
**Owner:** Hung Nguyen


**mLastResult** of **ImmSearchOp** was double freed in 
**immnd_evt_proc_search_next()** and later in 
**immnd_proc_imma_discard_connection()**.

In **immnd_evt_proc_search_next()**, before jumping to **agent_rsp**, 
**mLastResult** must be cleared. So that it will not be freed again in 
**immnd_proc_imma_discard_connection()**.


~~~
2017-04-25T05:30:24.483922-04:00 local0.warning scm2 osafimmnd[2793]: WA 
ERR_NO_RESOURCES: SearchNext: Implementer died during fetch of pure RTA
2017-04-25T05:30:24.670234-04:00 local0.notice scm2 osafamfnd[4402]: NO 
'safSu=SCM2,safSg=NoRed,safApp=OpenSAF' component restart probation timer 
started (timeout: 600 ns)
2017-04-25T05:30:24.670313-04:00 local0.notice scm2 osafamfnd[4402]: NO 
Restarting a component of 'safSu=SCM2,safSg=NoRed,safApp=OpenSAF' (comp restart 
count: 1)
2017-04-25T05:30:24.670514-04:00 local0.notice scm2 osafamfnd[4402]: NO 
'safComp=IMMND,safSu=SCM2,safSg=NoRed,safApp=OpenSAF' faulted due to 'avaDown' 
: Recovery is 'componentRestart'
~~~

~~~
GNU gdb (Wind River Linux Sourcery CodeBench 4.8-28) 7.6
Copyright (C) 2013 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later 
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-wrs-linux-gnu".
For bug reporting instructions, please see:
...
Reading symbols from /usr/lib64/opensaf/osafimmnd...Reading symbols from 
/usr/lib64/opensaf/.debug/osafimmnd...done.
done.
[New LWP 2793]
[New LWP 2797]
[New LWP 2795]
[New LWP 2796]

warning: Could not load shared library symbols for linux-vdso.so.1.
Do you need "set solib-search-path" or "set sysroot"?
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib64/libthread_db.so.1".
Core was generated by `/usr/lib64/opensaf/osafimmnd osafimmnd'.
Program terminated with signal 6, Aborted.
#0  0x003d84a353e9 in __GI_raise (sig=sig@entry=6) at 
../nptl/sysdeps/unix/sysv/linux/raise.c:56
56  ../nptl/sysdeps/unix/sysv/linux/raise.c: No such file or directory.
(gdb) bt
#0  0x003d84a353e9 in __GI_raise (sig=sig@entry=6) at 
../nptl/sysdeps/unix/sysv/linux/raise.c:56
#1  0x003d84a38508 in __GI_abort () at abort.c:89
#2  0x003d84a6e964 in __libc_message (do_abort=do_abort@entry=2, 
fmt=fmt@entry=0x3d84b65f88 "*** Error in `%s': %s: 0x%s ***\n") at 
../sysdeps/posix/libc_fatal.c:175
#3  0x003d84a786be in malloc_printerr (action=3, str=0x3d84b62052 "free(): 
invalid pointer", ptr=) at malloc.c:4895
#4  0x003d84a79397 in _int_free (av=, p=0xf8c0f0, 
have_lock=0) at malloc.c:3751
#5  0x004088af in freeSearchNext (rsp=0xbe5d60, freeTop=SA_TRUE) at 
immnd_evt.c:1378
#6  0x00424602 in immnd_proc_imma_discard_connection (cb=0x6eee60 
<_immnd_cb>, cl_node=0x956b60, scAbsence=false) at immnd_proc.c:108
#7  0x0040a657 in immnd_evt_proc_imm_finalize (cb=0x6eee60 <_immnd_cb>, 
evt=0x7ff2640029c0, sinfo=0x7ff264002b00, isOm=SA_TRUE) at immnd_evt.c:2071
#8  0x0040614c in immnd_process_evt () at immnd_evt.c:535
#9  0x00422e14 in main (argc=2, argv=0x7fff532890f8) at immnd_main.c:370
(gdb)
~~~


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets


[tickets] [opensaf:tickets] #2469 clm: Stop tracking api returns NOT_EXIST

2017-05-28 Thread Minh Hon Chau



---

** [tickets:#2469] clm: Stop tracking api returns NOT_EXIST**

**Status:** unassigned
**Milestone:** 5.17.06
**Created:** Mon May 29, 2017 12:19 AM UTC by Minh Hon Chau
**Last Updated:** Mon May 29, 2017 12:19 AM UTC
**Owner:** nobody


When performing switchover, AMFD fails to stop CLM track callback with error 
code 12 (NOT_EXIST)

**syslog:
**
2017-05-26 10:19:02 SC-1 osafamfd[268]: NO Controller switch over initiated
2017-05-26 10:19:02 SC-1 osafamfd[268]: NO ROLE SWITCH Active --> Quiesced
2017-05-26 10:19:02 SC-1 osafimmnd[205]: NO Implementer (applier) connected: 40 
(@OpenSafImmReplicatorB) <343, 2010f>
2017-05-26 10:19:02 SC-1 osafntfimcnd[626]: NO Started
2017-05-26 10:19:12 SC-1 osafamfd[268]: WA Failed to stop cluster tracking 5
2017-05-26 10:19:12 SC-1 osafimmnd[205]: NO Implementer disconnected 32 <27, 
2010f> (safAmfService)
2017-05-26 10:19:12 SC-1 osafimmnd[205]: NO Implementer (applier) connected: 41 
(@safAmfService2010f) <27, 2010f>
2017-05-26 10:19:12 SC-1 osafamfnd[283]: NO AVD NEW_ACTIVE, adest:1
2017-05-26 10:19:12 SC-1 osafimmnd[205]: NO Implementer disconnected 31 <0, 
2020f> (@safAmfService2020f)
2017-05-26 10:19:12 SC-1 osafimmnd[205]: NO Implementer connected: 42 
(safAmfService) <0, 2020f>
2017-05-26 10:19:12 SC-1 osafamfd[268]: NO Switching Quiesced --> StandBy
2017-05-26 10:19:13 SC-1 osafamfd[268]: ER Failed to stop cluster tracking 12
2017-05-26 10:19:13 SC-1 osafamfd[268]: ER Failed to stop cluster tracking 
after switch over
2017-05-26 10:19:13 SC-1 osafamfd[268]: NO Controller switch over done

**CLM trace:
**
May 26 10:19:13.173369 osafclmd [240:240:src/clm/clmd/clms_evt.c:1347] >> 
proc_track_stop_msg 
May 26 10:19:13.173374 osafclmd [240:240:src/clm/clmd/clms_util.c:0126] >> 
clms_node_get_by_id 
May 26 10:19:13.173379 osafclmd [240:240:src/clm/clmd/clms_util.c:0137] TR Node 
found 131343
May 26 10:19:13.173383 osafclmd [240:240:src/clm/clmd/clms_util.c:0140] << 
clms_node_get_by_id 
May 26 10:19:13.173388 osafclmd [240:240:src/clm/clmd/clms_evt.c:1350] TR Node 
id = 131343
May 26 10:19:13.173393 osafclmd [240:240:src/clm/clmd/clms_mds.c:1553] >> 
clms_mds_msg_send 
May 26 10:19:13.173448 osafclmd [240:240:src/clm/clmd/clms_mds.c:1587] << 
clms_mds_msg_send 
May 26 10:19:13.173457 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:0810] >> 
clms_send_async_update 
May 26 10:19:13.173462 osafclmd [240:240:src/mbc/mbcsv_api.c:0798] >> 
mbcsv_process_snd_ckpt_request: Sending checkpoint data to all STANDBY peers, 
as per the send-type specified
May 26 10:19:13.173504 osafclmd [240:240:src/mbc/mbcsv_api.c:0830] TR 
svc_id:48, pwe_hdl:65552
May 26 10:19:13.173509 osafclmd [240:240:src/mbc/mbcsv_util.c:0363] >> 
mbcsv_send_ckpt_data_to_all_peers 
May 26 10:19:13.173593 osafclmd [240:240:src/mbc/mbcsv_util.c:0411] TR 
dispatching FSM for NCSMBCSV_SEND_ASYNC_UPDATE
May 26 10:19:13.173599 osafclmd [240:240:src/mbc/mbcsv_act.c:0103] TR ASYNC 
update to be sent. role: 1, svc_id: 48, pwe_hdl: 65552
May 26 10:19:13.173604 osafclmd [240:240:src/mbc/mbcsv_util.c:0424] TR calling 
encode callback
May 26 10:19:13.173610 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:0740] >> 
mbcsv_callback 
May 26 10:19:13.173615 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:0856] >> 
ckpt_encode_cbk_handler 
May 26 10:19:13.173626 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:0867] TR 
cbk_arg->info.encode.io_msg_type type 1
May 26 10:19:13.173632 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:1307] >> 
ckpt_encode_async_update 
May 26 10:19:13.173637 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:1324] TR 
data->header.type 3
May 26 10:19:13.173641 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:1362] TR 
Async update CLMS_CKPT_TRACK_START
May 26 10:19:13.173646 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:1701] >> 
enc_mbcsv_track_changes_msg 
May 26 10:19:13.173650 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:1714] << 
enc_mbcsv_track_changes_msg 
May 26 10:19:13.173654 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:1515] << 
ckpt_encode_async_update 
May 26 10:19:13.173658 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:0910] << 
ckpt_encode_cbk_handler 
May 26 10:19:13.173663 osafclmd [240:240:src/clm/clmd/clms_mbcsv.c:0780] << 
mbcsv_callback 
May 26 10:19:13.173667 osafclmd [240:240:src/mbc/mbcsv_util.c:0469] TR send the 
encoded message to any other peer with same s/w version
May 26 10:19:13.173671 osafclmd [240:240:src/mbc/mbcsv_util.c:0472] TR 
dispatching FSM for NCSMBCSV_SEND_ASYNC_UPDATE
May 26 10:19:13.173675 osafclmd [240:240:src/mbc/mbcsv_act.c:0103] TR ASYNC 
update to be sent. role: 1, svc_id: 48, pwe_hdl: 65552
May 26 10:19:13.173680 osafclmd [240:240:src/mbc/mbcsv_mds.c:0197] >> 
mbcsv_mds_send_msg: sending to vdest:10
May 26 10:19:13.173684 osafclmd [240:240:src/mbc/mbcsv_mds.c:0218] TR send type 
MDS_SENDTYPE_REDRSP:
May 26 10:19:13.174115 osafclmd [240:243:src/mds/mds_dt_trans.c:0755] >> 
mdtm_process_poll_recv_data_tcp 
May 26 10:19:13.174192 osafclmd [240:240:src/mbc/mbcsv_mds.c:0252] <<