- **status**: unassigned --> wontfix
- **assigned_to**: Nagendra Kumar
- **Comment**:

Once https://sourceforge.net/p/opensaf/tickets/1342/ is fixed, this #1784 will 
get resolved.



---

** [tickets:#1784] Amfd asserts on clm locked controller after successfully 
taking active role as a part of  failover**

**Status:** wontfix
**Milestone:** 5.0.1
**Created:** Tue Apr 26, 2016 11:47 AM UTC by Ritu Raj
**Last Updated:** Wed May 04, 2016 04:13 PM UTC
**Owner:** Nagendra Kumar
**Attachments:**

- 
[messages](https://sourceforge.net/p/opensaf/tickets/1784/attachment/messages) 
(3.2 MB; application/octet-stream)
- 
[osafamfd](https://sourceforge.net/p/opensaf/tickets/1784/attachment/osafamfd) 
(7.4 MB; application/octet-stream)


setup:
Changeset- 7436
Version - opensaf 5.0 FC

 * Issue Observed :
Amfd asserts on clm locked controller after successfully taking active role as 
a part of  failover.  


* Steps To Reproduce:
1. OpenSAF running on 4 nodes, where SC-1 is Active , SC-2 Standby and PL-3 and 
PL-4 are payloads.
2. Performed CLM lock of stanby controller (SC-2),
3. Now, perform failover on active controller(SC-1)
4. Observed that amfd asserted on clm locked controller(SC-2) and cluster reset 
happened

>SLOT-2:~ # Apr 26 14:56:06 SLOT-2 osafimmd[2199]: WA IMMD lost contact with 
>peer IMMD (NCSMDS_RED_DOWN)
.......
Apr 26 14:56:11 SLOT-2 osaffmd[2189]: NO Node Down event for node id 2010f:
Apr 26 14:56:11 SLOT-2 osaffmd[2189]: NO Current role: STANDBY
.......
Apr 26 14:56:11 SLOT-2 osafrded[2180]: NO Peer down on node 0x2010f
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: WA IMMND DOWN on active controller 1 
detected at standby immd!! 2. Possible failover
.......
Apr 26 14:56:11 SLOT-2 opensaf_reboot: Rebooting remote node in the absence of 
PLM is outside the scope of OpenSAF
Apr 26 14:56:11 SLOT-2 osaffmd[2189]: NO Controller Failover: Setting role to 
ACTIVE
Apr 26 14:56:11 SLOT-2 osafrded[2180]: NO RDE role set to ACTIVE
Apr 26 14:56:11 SLOT-2 osafrded[2180]: NO Running 
'/usr/lib64/opensaf/opensaf_sc_active' with 0 argument(s)
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osaflogd[2224]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osafntfd[2234]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osafclmd[2244]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: NO FAILOVER StandBy --> Active
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: NO AVD NEW_ACTIVE, adest:1
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: NO ellect_coord invoke from rda_callback 
ACTIVE
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: NO New coord elected, resides at 2020f
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO 2PBE configured, 
IMMSV_PBE_FILE_SUFFIX:.2020f (sync)
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO This IMMND is now the NEW Coord
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO SETTING COORD TO 1 CLOUD PROTO
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer disconnected 16 <139, 
2020f> (@safAmfService2020f)
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer connected: 18 
(safLogService) <126, 2020f>
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer connected: 19 
(safAmfService) <139, 2020f>
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: NO Node 'SC-1' left the cluster
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: NO FAILOVER StandBy --> Active DONE!
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: NO Assigning 
'safSi=SC-2N,safApp=OpenSAF' ACTIVE to 'safSu=SC-2,safSg=2N,safApp=OpenSAF'
Apr 26 14:56:11 SLOT-2 osafntfimcnd[2419]: NO exiting on signal 15
......
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer connected: 27 
(safSmfService) <337, 2020f>
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: NO Assigned 
'safSi=SC-2N,safApp=OpenSAF' ACTIVE to 'safSu=SC-2,safSg=2N,safApp=OpenSAF'
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: ER Wrong rootCauseEntity �H�
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: clm.cc:312: clm_track_cb: Assertion '0' 
failed.
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: WA AMF director unexpectedly crashed
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: Rebooting OpenSAF NodeId = 131599 EE 
Name = , Reason: local AVD down(Adest) or both AVD down(Vdest) received, 
OwnNodeId = 131599, SupervisionTime = 60
Apr 26 14:56:11 SLOT-2 opensaf_reboot: Rebooting local node; timeout=60


* Syslog and amfd trace attached
 Note: The issue is observed randomly


---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic
patterns at an interface-level. Reveals which users, apps, and protocols are 
consuming the most bandwidth. Provides multi-vendor support for NetFlow, 
J-Flow, sFlow and other flows. Make informed decisions using capacity 
planning reports. https://ad.doubleclick.net/ddm/clk/305295220;132659582;e
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to