- **Milestone**: 5.0.RC2 --> 5.0.1


---

** [tickets:#1784] Amfd asserts on clm locked controller after successfully 
taking active role as a part of  failover**

**Status:** unassigned
**Milestone:** 5.0.1
**Created:** Tue Apr 26, 2016 11:47 AM UTC by Ritu Raj
**Last Updated:** Thu Apr 28, 2016 10:11 AM UTC
**Owner:** nobody
**Attachments:**

- 
[messages](https://sourceforge.net/p/opensaf/tickets/1784/attachment/messages) 
(3.2 MB; application/octet-stream)
- 
[osafamfd](https://sourceforge.net/p/opensaf/tickets/1784/attachment/osafamfd) 
(7.4 MB; application/octet-stream)


setup:
Changeset- 7436
Version - opensaf 5.0 FC

 * Issue Observed :
Amfd asserts on clm locked controller after successfully taking active role as 
a part of  failover.  


* Steps To Reproduce:
1. OpenSAF running on 4 nodes, where SC-1 is Active , SC-2 Standby and PL-3 and 
PL-4 are payloads.
2. Performed CLM lock of stanby controller (SC-2),
3. Now, perform failover on active controller(SC-1)
4. Observed that amfd asserted on clm locked controller(SC-2) and cluster reset 
happened

>SLOT-2:~ # Apr 26 14:56:06 SLOT-2 osafimmd[2199]: WA IMMD lost contact with 
>peer IMMD (NCSMDS_RED_DOWN)
.......
Apr 26 14:56:11 SLOT-2 osaffmd[2189]: NO Node Down event for node id 2010f:
Apr 26 14:56:11 SLOT-2 osaffmd[2189]: NO Current role: STANDBY
.......
Apr 26 14:56:11 SLOT-2 osafrded[2180]: NO Peer down on node 0x2010f
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: WA IMMND DOWN on active controller 1 
detected at standby immd!! 2. Possible failover
.......
Apr 26 14:56:11 SLOT-2 opensaf_reboot: Rebooting remote node in the absence of 
PLM is outside the scope of OpenSAF
Apr 26 14:56:11 SLOT-2 osaffmd[2189]: NO Controller Failover: Setting role to 
ACTIVE
Apr 26 14:56:11 SLOT-2 osafrded[2180]: NO RDE role set to ACTIVE
Apr 26 14:56:11 SLOT-2 osafrded[2180]: NO Running 
'/usr/lib64/opensaf/opensaf_sc_active' with 0 argument(s)
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osaflogd[2224]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osafntfd[2234]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osafclmd[2244]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: NO FAILOVER StandBy --> Active
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: NO AVD NEW_ACTIVE, adest:1
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: NO ellect_coord invoke from rda_callback 
ACTIVE
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: NO New coord elected, resides at 2020f
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO 2PBE configured, 
IMMSV_PBE_FILE_SUFFIX:.2020f (sync)
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO This IMMND is now the NEW Coord
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO SETTING COORD TO 1 CLOUD PROTO
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer disconnected 16 <139, 
2020f> (@safAmfService2020f)
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer connected: 18 
(safLogService) <126, 2020f>
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer connected: 19 
(safAmfService) <139, 2020f>
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: NO Node 'SC-1' left the cluster
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: NO FAILOVER StandBy --> Active DONE!
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: NO Assigning 
'safSi=SC-2N,safApp=OpenSAF' ACTIVE to 'safSu=SC-2,safSg=2N,safApp=OpenSAF'
Apr 26 14:56:11 SLOT-2 osafntfimcnd[2419]: NO exiting on signal 15
......
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer connected: 27 
(safSmfService) <337, 2020f>
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: NO Assigned 
'safSi=SC-2N,safApp=OpenSAF' ACTIVE to 'safSu=SC-2,safSg=2N,safApp=OpenSAF'
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: ER Wrong rootCauseEntity �H�
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: clm.cc:312: clm_track_cb: Assertion '0' 
failed.
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: WA AMF director unexpectedly crashed
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: Rebooting OpenSAF NodeId = 131599 EE 
Name = , Reason: local AVD down(Adest) or both AVD down(Vdest) received, 
OwnNodeId = 131599, SupervisionTime = 60
Apr 26 14:56:11 SLOT-2 opensaf_reboot: Rebooting local node; timeout=60


* Syslog and amfd trace attached
 Note: The issue is observed randomly


---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Find and fix application performance issues faster with Applications Manager
Applications Manager provides deep performance insights into multiple tiers of
your business applications. It resolves application problems quickly and
reduces your MTTR. Get your free trial!
https://ad.doubleclick.net/ddm/clk/302982198;130105516;z
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to