logs attached again. 
steps to reproduce. 
1) Brinup up 2n model
2) Make sure components which receive active cbk report error on self with 
recovery=componentFailover
3) lock/unlock active Su.
Due to continuous faults on all nodes, a crash is observed on payload and 
cluster as expected will go for reboot. 

This time, also observed amfnd crash on pl-3. 
Program terminated with signal 6, Aborted.
#0  0x00007f8d74463b55 in raise () from /lib64/libc.so.6
(gdb) bt
#0  0x00007f8d74463b55 in raise () from /lib64/libc.so.6
#1  0x00007f8d74465131 in abort () from /lib64/libc.so.6
#2  0x0000000000405466 in defaultImmutilError (fmt=0x454c98 "saImmOmInitialize 
FAILED, rc = %d")
    at ../../../../../osaf/tools/safimm/src/immutil.c:72
#3  0x0000000000407e97 in immutil_saImmOmInitialize (immHandle=0x7fff11e61908, 
immCallbacks=0x0, 
    version=0x7fff11e61900) at 
../../../../../osaf/tools/safimm/src/immutil.c:1127
#4  0x000000000042aab8 in avnd_compdb_init (cb=0x66e000) at avnd_compdb.c:183
#5  0x00000000004096ac in avnd_cb_create () at avnd.c:270
#6  0x00000000004093c6 in avnd_create () at avnd.c:126
#7  0x000000000043f86e in avnd_main_process () at avnd_proc.c:167
#8  0x000000000040916e in main (argc=2, argv=0x7fff11e61ad8) at amfnd_main.c:53
(gdb) bt full
#0  0x00007f8d74463b55 in raise () from /lib64/libc.so.6
No symbol table info available.
#1  0x00007f8d74465131 in abort () from /lib64/libc.so.6
No symbol table info available.
#2  0x0000000000405466 in defaultImmutilError (fmt=0x454c98 "saImmOmInitialize 
FAILED, rc = %d")
    at ../../../../../osaf/tools/safimm/src/immutil.c:72
        ap = {{gp_offset = 16, fp_offset = 48, overflow_arg_area = 
0x7fff11e618b0, 
            reg_save_area = 0x7fff11e617f0}}
        ap2 = {{gp_offset = 16, fp_offset = 48, overflow_arg_area = 
0x7fff11e618b0, 
            reg_save_area = 0x7fff11e617f0}}
#3  0x0000000000407e97 in immutil_saImmOmInitialize (immHandle=0x7fff11e61908, 
immCallbacks=0x0, 
    version=0x7fff11e61900) at 
../../../../../osaf/tools/safimm/src/immutil.c:1127
        localVer = {releaseCode = 65 'A', majorVersion = 2 '\002', minorVersion 
= 12 '\f'}
        rc = SA_AIS_ERR_TRY_AGAIN
        nTries = 25
#4  0x000000000042aab8 in avnd_compdb_init (cb=0x66e000) at avnd_compdb.c:183
        params = {key_size = 0, info_size = 0, actual_key_size = 0, node_size = 
0}
        rc = 1
        immOmHandle = 6742016
        immVersion = {releaseCode = 65 'A', majorVersion = 2 '\002', 
minorVersion = 1 '\001'}
        __FUNCTION__ = "avnd_compdb_init"
#5  0x00000000004096ac in avnd_cb_create () at avnd.c:270
        cb = 0x66e000
        val = 0x0
        __FUNCTION__ = "avnd_cb_create"
#6  0x00000000004093c6 in avnd_create () at avnd.c:126
        cb = 0x0
        rc = 1
---Type <return> to continue, or q <return> to quit---
        __FUNCTION__ = "avnd_create"
#7  0x000000000043f86e in avnd_main_process () at avnd_proc.c:167
        mbx_fd = {raise_obj = 300292560, rmv_obj = 32767}
        fds = {{fd = 1954156160, events = 32653, revents = 0}, {fd = 0, events 
= 1, revents = 0}, {fd = 0, 
            events = 0, revents = 0}, {fd = 1972322728, events = 1, revents = 
0}}
        nfds = 3
        evt = 0x2756e2640
        __FUNCTION__ = "avnd_main_process"
#8  0x000000000040916e in main (argc=2, argv=0x7fff11e61ad8) at amfnd_main.c:53
        error = 0
(gdb) 


Attachment: logs_new.tgz (13.8 MB; application/x-compressed-tar) 


---

** [tickets:#855] immnd crash on payload node**

**Status:** assigned
**Milestone:** 4.3.3
**Created:** Tue Apr 15, 2014 10:11 AM UTC by surender khetavath
**Last Updated:** Wed Apr 16, 2014 11:32 AM UTC
**Owner:** Anders Bjornerstedt

case:
1) A component calls exit() when active_cbk is received. 

All the components on all the nodes, due to continuous faults received 
active-cbk and called exit() within the comp and cluster went for reboot. That 
is expected. But immnd crashed on PL-4 and PL-5

/var/log/messages on PL-4 show:

Apr 15 15:12:01 PL-4 osafimmnd[7650]: ER Sync-verify: Established node has 
different Implementer-id: 0 for name: @COMP2SU1TWONAPP, sync says 109.
Apr 15 15:12:01 PL-4 osafamfnd[7668]: NO 
'safComp=IMMND,safSu=PL-4,safSg=NoRed,safApp=OpenSAF' faulted due to 'avaDown' 
: Recovery is 'componentRestart'


(gdb) fr 2
#2  0x000000000043b7ec in ImmModel::finalizeSync (this=0x6baa00, 
req=0x7fffcb25d160, isCoord=false, 
    isSyncClient=false) at ImmModel.cc:14482
14482                           abort();
(gdb) l
14477   
14478                       if(!explained) {
14479                           LOG_ER("Sync-verify: Established node has 
different "
14480                                  "Implementer-id: %u for name: %s, sync 
says %u. ",
14481                                  info->mId, implName.c_str(), ii->id);
14482                           abort();
14483                       }
14484   
14485                   } else if(info->mNodeId != ii->nodeId) {
14486                       LOG_ER("Sync-verify: Missmatch on node-id "
(gdb) p explained
$1 = false
(gdb) q

logs attached and gdb output attached. 


---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Start Your Social Network Today - Download eXo Platform
Build your Enterprise Intranet with eXo Platform Software
Java Based Open Source Intranet - Social, Extensible, Cloud Ready
Get Started Now And Turn Your Intranet Into A Collaboration Platform
http://p.sf.net/sfu/ExoPlatform
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to