in the log above it seems nid is finished and
Feb 7 11:41:24 SC-2 osafamfd[5422]: exiting for shutdown
and causes a node reboot. The core dump seems produced from nodeinit.
Run gdb command thread apply all bt full on the core dump,
but I guess this fault happens during nodeinit exit and this ticket is a 
duplicate of #2294


---

** [tickets:#2289] opensafd (nid): coredump while standby starting**

**Status:** unassigned
**Milestone:** 5.2.RC1
**Created:** Tue Feb 07, 2017 06:31 AM UTC by A V Mahesh (AVM)
**Last Updated:** Thu Feb 23, 2017 12:31 PM UTC
**Owner:** nobody


Restart Standby with TCP , opensafd core dumping

========================================================================
(gdb) bt
/#0  0x00007f2f05cb0b55 in raise () from /lib64/libc.so.6
/#1  0x00007f2f05cb2131 in abort () from /lib64/libc.so.6
/#2  0x00007f2f06704955 in __gnu_cxx::__verbose_terminate_handler() () at 
../../../../gcc-4.8.3/libstdc++-v3/libsupc++/vterminate.cc:95
/#3  0x00007f2f06702af6 in __cxxabiv1::__terminate(void (*)()) () at 
../../../../gcc-4.8.3/libstdc++-v3/libsupc++/eh_terminate.cc:38
/#4  0x00007f2f06702b23 in std::terminate() () at 
../../../../gcc-4.8.3/libstdc++-v3/libsupc++/eh_terminate.cc:48
/#5  0x00007f2f06702d42 in __cxa_throw () at 
../../../../gcc-4.8.3/libstdc++-v3/libsupc++/eh_throw.cc:87
/#6  0x00007f2f0670322d in operator new(unsigned long) () at 
../../../../gcc-4.8.3/libstdc++-v3/libsupc++/new_op.cc:56
/#7  0x00007f2f06761979 in std::string::_Rep::_S_create(unsigned long, unsigned 
long, std::allocator<char> const&) ()
    at 
/home/build/x86_64-unknown-linux-gnu/libstdc++-v3/include/ext/new_allocator.h:104
#8  0x00007f2f0676256b in std::string::_Rep::_M_clone(std::allocator<char> 
const&, unsigned long) () at 
/home/build/x86_64-unknown-linux-gnu/libstdc++-v3/include/bits/basic_string.tcc:629
#9  0x00007f2f06762bec in std::basic_string<char, std::char_traits<char>, 
std::allocator<char> >::basic_string(std::string const&) ()
    at 
/home/build/x86_64-unknown-linux-gnu/libstdc++-v3/include/bits/basic_string.h:229
#10 0x00007f2f07262c39 in handle_data_request(pollfd*, std::string const&) () 
at /usr/include/c++/4.8.3/bits/basic_string.h:2405
#11 0x00007f2f0726320f in svc_monitor_thread(void*) () at 
src/nid/nodeinit.cc:1539
#12 0x00007f2f05ff97b6 in start_thread () from /lib64/libpthread.so.0
#13 0x00007f2f05d559cd in clone () from /lib64/libc.so.6
#14 0x0000000000000000 in ?? ()
(gdb) q

========================================================================

Feb  7 11:41:13 SC-2 opensafd: OpenSAF services successfully stopped
Feb  7 11:41:21 SC-2 opensafd: Starting OpenSAF Services(5.1.M0 - ) (Using TCP)
Feb  7 11:41:21 SC-2 osafdtmd[5329]: mkfifo already exists: 
/var/lib/opensaf/osafdtmd.fifo File exists
Feb  7 11:41:21 SC-2 osafdtmd[5329]: Started
Feb  7 11:41:21 SC-2 osaftransportd[5336]: Started
Feb  7 11:41:21 SC-2 osafclmna[5343]: Started
Feb  7 11:41:21 SC-2 osafrded[5352]: Started
Feb  7 11:41:22 SC-2 osaffmd[5361]: Started
Feb  7 11:41:22 SC-2 osaffmd[5361]: NO Remote fencing is disabled
Feb  7 11:41:22 SC-2 osafimmd[5371]: Started
Feb  7 11:41:22 SC-2 osafimmd[5371]: NO ******* SC_ABSENCE_ALLOWED (Headless 
Hydra) is configured: 900 ***********
Feb  7 11:41:22 SC-2 osafimmnd[5382]: Started
Feb  7 11:41:22 SC-2 osafimmnd[5382]: NO Persistent Back-End capability 
configured, Pbe file:imm.db (suffix may get added)
Feb  7 11:41:22 SC-2 opensafd[5318]: NO Monitoring of TRANSPORT started
Feb  7 11:41:22 SC-2 osafclmna[5343]: NO Starting to promote this node to a 
system controller
Feb  7 11:41:22 SC-2 osafrded[5352]: NO Requesting ACTIVE role
Feb  7 11:41:22 SC-2 osafrded[5352]: NO RDE role set to Undefined
Feb  7 11:41:22 SC-2 osafdtmd[5329]: NO Established contact with 'PL-3'
Feb  7 11:41:22 SC-2 osafdtmd[5329]: NO Established contact with 'SC-1'
Feb  7 11:41:22 SC-2 osafdtmd[5329]: NO Established contact with 'PL-4'
Feb  7 11:41:22 SC-2 osafrded[5352]: NO Peer up on node 0x2010f
Feb  7 11:41:22 SC-2 osafimmnd[5382]: NO IMMD service is UP ... 
ScAbsenseAllowed?:0 introduced?:0
Feb  7 11:41:22 SC-2 osafrded[5352]: NO Got peer info request from node 0x2010f 
with role ACTIVE
Feb  7 11:41:22 SC-2 osafrded[5352]: NO Got peer info response from node 
0x2010f with role ACTIVE
Feb  7 11:41:22 SC-2 osafrded[5352]: NO RDE role set to QUIESCED
Feb  7 11:41:22 SC-2 osafrded[5352]: NO Giving up election against 0x2010f with 
role ACTIVE. My role is now QUIESCED
Feb  7 11:41:22 SC-2 osafclmna[5343]: NO safNode=SC-2,safCluster=myClmCluster 
Joined cluster, nodeid=2020f
Feb  7 11:41:22 SC-2 osafimmnd[5382]: NO Fevs count adjusted to 2835 
preLoadPid: 0
Feb  7 11:41:22 SC-2 osafimmnd[5382]: NO SERVER STATE: IMM_SERVER_ANONYMOUS --> 
IMM_SERVER_CLUSTER_WAITING
Feb  7 11:41:22 SC-2 osafimmnd[5382]: NO SERVER STATE: 
IMM_SERVER_CLUSTER_WAITING --> IMM_SERVER_LOADING_PENDING
Feb  7 11:41:22 SC-2 osafimmnd[5382]: NO SERVER STATE: 
IMM_SERVER_LOADING_PENDING --> IMM_SERVER_SYNC_PENDING
Feb  7 11:41:22 SC-2 osafimmnd[5382]: NO NODE STATE-> IMM_NODE_ISOLATED
Feb  7 11:41:23 SC-2 osafimmnd[5382]: NO NODE STATE-> IMM_NODE_W_AVAILABLE
Feb  7 11:41:23 SC-2 osafimmnd[5382]: NO SERVER STATE: IMM_SERVER_SYNC_PENDING 
--> IMM_SERVER_SYNC_CLIENT
Feb  7 11:41:23 SC-2 osafimmnd[5382]: NO NODE STATE-> IMM_NODE_FULLY_AVAILABLE 
2926
Feb  7 11:41:23 SC-2 osafimmnd[5382]: NO RepositoryInitModeT is 
SA_IMM_INIT_FROM_FILE
Feb  7 11:41:23 SC-2 osafimmnd[5382]: WA IMM Access Control mode is DISABLED!
Feb  7 11:41:23 SC-2 osafimmnd[5382]: NO Epoch set to 8 in ImmModel
Feb  7 11:41:23 SC-2 opensafd[5318]: NO Monitoring of IMMND started
Feb  7 11:41:23 SC-2 osafimmnd[5382]: NO SERVER STATE: IMM_SERVER_SYNC_CLIENT 
--> IMM_SERVER_READY
Feb  7 11:41:23 SC-2 osafimmnd[5382]: NO ImmModel received scAbsenceAllowed 900
Feb  7 11:41:23 SC-2 osaflogd[5392]: Started
Feb  7 11:41:23 SC-2 opensafd[5318]: NO Monitoring of LOGD started
Feb  7 11:41:23 SC-2 osafntfd[5402]: Started
Feb  7 11:41:23 SC-2 opensafd[5318]: NO Monitoring of NTFD started
Feb  7 11:41:23 SC-2 osafclmd[5412]: Started
Feb  7 11:41:23 SC-2 opensafd[5318]: NO Monitoring of CLMD started
Feb  7 11:41:23 SC-2 osafamfd[5422]: Started
Feb  7 11:41:23 SC-2 opensafd[5318]: NO Monitoring of AMFD started
Feb  7 11:41:24 SC-2 osafamfnd[5432]: mkfifo already exists: 
/var/lib/opensaf/osafamfnd.fifo File exists
Feb  7 11:41:24 SC-2 osafamfnd[5432]: Started
Feb  7 11:41:24 SC-2 osafamfnd[5432]: NO Start monitoring AMFD using 
/var/lib/opensaf/osafamfd.fifo
Feb  7 11:41:24 SC-2 osafamfnd[5432]: NO Sending node up due to NCSMDS_UP
Feb  7 11:41:24 SC-2 osafamfnd[5432]: NO 'safSu=SC-2,safSg=2N,safApp=OpenSAF' 
Presence State UNINSTANTIATED => INSTANTIATING
Feb  7 11:41:24 SC-2 osafamfnd[5432]: NO 
'safSu=SC-2,safSg=NoRed,safApp=OpenSAF' Presence State UNINSTANTIATED => 
INSTANTIATING
Feb  7 11:41:24 SC-2 osafamfwd[5448]: mkfifo already exists: 
/var/lib/opensaf/osafamfwd.fifo File exists
Feb  7 11:41:24 SC-2 osafamfwd[5448]: Started
Feb  7 11:41:24 SC-2 osafckptd[5453]: Started
Feb  7 11:41:24 SC-2 osafckptnd[5467]: Started
Feb  7 11:41:24 SC-2 osafevtd[5469]: Started
Feb  7 11:41:24 SC-2 osaflcknd[5487]: Started
Feb  7 11:41:24 SC-2 osaflckd[5502]: Started
Feb  7 11:41:24 SC-2 osafmsgnd[5520]: Started
Feb  7 11:41:24 SC-2 osafimmnd[5382]: NO Implementer connected: 25 
(MsgQueueService131599) <124, 2020f>
Feb  7 11:41:24 SC-2 osafsmfnd[5538]: Started
Feb  7 11:41:24 SC-2 osafmsgd[5551]: Started
Feb  7 11:41:24 SC-2 osafamfnd[5432]: NO 
'safSu=SC-2,safSg=NoRed,safApp=OpenSAF' Presence State INSTANTIATING => 
INSTANTIATED
Feb  7 11:41:24 SC-2 osafamfnd[5432]: NO Assigning 
'safSi=NoRed2,safApp=OpenSAF' ACTIVE to 'safSu=SC-2,safSg=NoRed,safApp=OpenSAF'
Feb  7 11:41:24 SC-2 osafamfnd[5432]: NO Assigned 'safSi=NoRed2,safApp=OpenSAF' 
ACTIVE to 'safSu=SC-2,safSg=NoRed,safApp=OpenSAF'
Feb  7 11:41:24 SC-2 osafsmfd[5569]: Started
Feb  7 11:41:24 SC-2 osafamfnd[5432]: NO 'safSu=SC-2,safSg=2N,safApp=OpenSAF' 
Presence State INSTANTIATING => INSTANTIATED
Feb  7 11:41:24 SC-2 osafamfnd[5432]: NO Assigning 'safSi=SC-2N,safApp=OpenSAF' 
STANDBY to 'safSu=SC-2,safSg=2N,safApp=OpenSAF'
Feb  7 11:41:24 SC-2 osafrded[5352]: NO RDE role set to STANDBY
Feb  7 11:41:24 SC-2 osafrded[5352]: NO Peer up on node 0x2010f
Feb  7 11:41:24 SC-2 osafrded[5352]: NO Got peer info request from node 0x2010f 
with role ACTIVE
Feb  7 11:41:24 SC-2 osafrded[5352]: NO Got peer info response from node 
0x2010f with role ACTIVE
Feb  7 11:41:24 SC-2 osafimmd[5371]: NO MDS event from svc_id 24 (change:5, 
dest:13)
Feb  7 11:41:24 SC-2 osafimmd[5371]: NO MDS event from svc_id 24 (change:3, 
dest:13)
Feb  7 11:41:24 SC-2 osafimmd[5371]: NO MDS event from svc_id 24 (change:5, 
dest:13)
Feb  7 11:41:24 SC-2 osafimmd[5371]: NO MDS event from svc_id 25 (change:3, 
dest:567412424446133)
Feb  7 11:41:24 SC-2 osafimmd[5371]: NO MDS event from svc_id 25 (change:3, 
dest:564113889574456)
Feb  7 11:41:24 SC-2 osafimmd[5371]: NO MDS event from svc_id 25 (change:3, 
dest:566312912818470)
Feb  7 11:41:24 SC-2 osafimmd[5371]: NO MDS event from svc_id 25 (change:3, 
dest:565213401191686)
Feb  7 11:41:24 SC-2 osafimmnd[5382]: NO Implementer (applier) connected: 26 
(@safAmfService2020f) <127, 2020f>
Feb  7 11:41:24 SC-2 osaflogd[5392]: NO LOGSV_DATA_GROUPNAME not found
Feb  7 11:41:24 SC-2 osaflogd[5392]: NO LOG root directory is: 
"/var/log/opensaf/saflog"
Feb  7 11:41:24 SC-2 osaflogd[5392]: NO LOG data group is: ""
Feb  7 11:41:24 SC-2 osaflogd[5392]: NO LGS_MBCSV_VERSION = 5
Feb  7 11:41:24 SC-2 osafamfnd[5432]: NO Assigned 'safSi=SC-2N,safApp=OpenSAF' 
STANDBY to 'safSu=SC-2,safSg=2N,safApp=OpenSAF'
Feb  7 11:41:24 SC-2 osafamfd[5422]: exiting for shutdown
Feb  7 11:41:24 SC-2 osafamfnd[5432]: ER AMFD has unexpectedly crashed. 
Rebooting node
Feb  7 11:41:24 SC-2 osafamfnd[5432]: Rebooting OpenSAF NodeId = 131599 EE Name 
= , Reason: AMFD has unexpectedly crashed. Rebooting node, OwnNodeId = 131599, 
SupervisionTime = 60
Feb  7 11:41:24 SC-2 osafimmnd[5382]: NO Implementer locally disconnected. 
Marking it as doomed 26 <127, 2020f> (@safAmfService2020f)
Feb  7 11:41:24 SC-2 osafimmnd[5382]: NO Implementer disconnected 26 <127, 
2020f> (@safAmfService2020f)
Feb  7 11:41:24 SC-2 opensaf_reboot: Rebooting local node; timeout=60


---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to