- **status**: unassigned --> wontfix
- **assigned_to**: Nagendra Kumar
- **Comment**:
Once https://sourceforge.net/p/opensaf/tickets/1342/ is fixed, this #1784 will
get resolved.
---
** [tickets:#1784] Amfd asserts on clm locked controller after successfully
taking active role as a part of failover**
**Status:** wontfix
**Milestone:** 5.0.1
**Created:** Tue Apr 26, 2016 11:47 AM UTC by Ritu Raj
**Last Updated:** Wed May 04, 2016 04:13 PM UTC
**Owner:** Nagendra Kumar
**Attachments:**
-
[messages](https://sourceforge.net/p/opensaf/tickets/1784/attachment/messages)
(3.2 MB; application/octet-stream)
-
[osafamfd](https://sourceforge.net/p/opensaf/tickets/1784/attachment/osafamfd)
(7.4 MB; application/octet-stream)
setup:
Changeset- 7436
Version - opensaf 5.0 FC
* Issue Observed :
Amfd asserts on clm locked controller after successfully taking active role as
a part of failover.
* Steps To Reproduce:
1. OpenSAF running on 4 nodes, where SC-1 is Active , SC-2 Standby and PL-3 and
PL-4 are payloads.
2. Performed CLM lock of stanby controller (SC-2),
3. Now, perform failover on active controller(SC-1)
4. Observed that amfd asserted on clm locked controller(SC-2) and cluster reset
happened
>SLOT-2:~ # Apr 26 14:56:06 SLOT-2 osafimmd[2199]: WA IMMD lost contact with
>peer IMMD (NCSMDS_RED_DOWN)
.......
Apr 26 14:56:11 SLOT-2 osaffmd[2189]: NO Node Down event for node id 2010f:
Apr 26 14:56:11 SLOT-2 osaffmd[2189]: NO Current role: STANDBY
.......
Apr 26 14:56:11 SLOT-2 osafrded[2180]: NO Peer down on node 0x2010f
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: WA IMMND DOWN on active controller 1
detected at standby immd!! 2. Possible failover
.......
Apr 26 14:56:11 SLOT-2 opensaf_reboot: Rebooting remote node in the absence of
PLM is outside the scope of OpenSAF
Apr 26 14:56:11 SLOT-2 osaffmd[2189]: NO Controller Failover: Setting role to
ACTIVE
Apr 26 14:56:11 SLOT-2 osafrded[2180]: NO RDE role set to ACTIVE
Apr 26 14:56:11 SLOT-2 osafrded[2180]: NO Running
'/usr/lib64/opensaf/opensaf_sc_active' with 0 argument(s)
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osaflogd[2224]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osafntfd[2234]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osafclmd[2244]: NO ACTIVE request
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: NO FAILOVER StandBy --> Active
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: NO AVD NEW_ACTIVE, adest:1
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: NO ellect_coord invoke from rda_callback
ACTIVE
Apr 26 14:56:11 SLOT-2 osafimmd[2199]: NO New coord elected, resides at 2020f
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO 2PBE configured,
IMMSV_PBE_FILE_SUFFIX:.2020f (sync)
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO This IMMND is now the NEW Coord
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO SETTING COORD TO 1 CLOUD PROTO
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer disconnected 16 <139,
2020f> (@safAmfService2020f)
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer connected: 18
(safLogService) <126, 2020f>
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer connected: 19
(safAmfService) <139, 2020f>
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: NO Node 'SC-1' left the cluster
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: NO FAILOVER StandBy --> Active DONE!
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: NO Assigning
'safSi=SC-2N,safApp=OpenSAF' ACTIVE to 'safSu=SC-2,safSg=2N,safApp=OpenSAF'
Apr 26 14:56:11 SLOT-2 osafntfimcnd[2419]: NO exiting on signal 15
......
Apr 26 14:56:11 SLOT-2 osafimmnd[2210]: NO Implementer connected: 27
(safSmfService) <337, 2020f>
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: NO Assigned
'safSi=SC-2N,safApp=OpenSAF' ACTIVE to 'safSu=SC-2,safSg=2N,safApp=OpenSAF'
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: ER Wrong rootCauseEntity �H�
Apr 26 14:56:11 SLOT-2 osafamfd[2254]: clm.cc:312: clm_track_cb: Assertion '0'
failed.
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: WA AMF director unexpectedly crashed
Apr 26 14:56:11 SLOT-2 osafamfnd[2264]: Rebooting OpenSAF NodeId = 131599 EE
Name = , Reason: local AVD down(Adest) or both AVD down(Vdest) received,
OwnNodeId = 131599, SupervisionTime = 60
Apr 26 14:56:11 SLOT-2 opensaf_reboot: Rebooting local node; timeout=60
* Syslog and amfd trace attached
Note: The issue is observed randomly
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic
patterns at an interface-level. Reveals which users, apps, and protocols are
consuming the most bandwidth. Provides multi-vendor support for NetFlow,
J-Flow, sFlow and other flows. Make informed decisions using capacity
planning reports. https://ad.doubleclick.net/ddm/clk/305295220;132659582;e
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets