Hi ,

Following is the analysis of the shared logs:

1. Both the controllers lost contact with each other

scm1
----

113141:2014-02-25T22:40:28.111835+00:00 scm1 osafimmd[2344]: NO SBY: New
Epoch for IMMND process at node 1130f old epoch: 0  new epoch:27
  113187:2014-02-25T22:47:28.525892+00:00 scm1 osaffmd[2334]: NO Role:
STANDBY, Node Down for node id: 1100f
  113188:2014-02-25T22:47:28.525952+00:00 scm1 osaffmd[2334]: Rebooting
OpenSAF NodeId = 69647 EE Name = , Reason: Received Node Down for peer
controller, OwnNodeId = 69391, SupervisionTime = 60
  113189:2014-02-25T22:47:28.525975+00:00 scm1 osafimmd[2344]: WA IMMND
DOWN on active controller 100 detected at standby immd!! ff. Possible
failover

scm2
----

165136:2014-02-25T22:47:30.472938+00:00 scm2 osafimmd[2272]: WA IMMD
lost contact with peer IMMD (NCSMDS_RED_DOWN)
  165137:2014-02-25T22:47:30.473003+00:00 scm2 osaffmd[2262]: NO Role:
ACTIVE, Node Down for node id: 10f0f
  165138:2014-02-25T22:47:30.473026+00:00 scm2 osaffmd[2262]: Rebooting
OpenSAF NodeId = 69391 EE Name = , Reason: Received Node Down for peer
controller, OwnNodeId = 69647, SupervisionTime = 60
  165139:2014-02-25T22:47:30.473051+00:00 scm2 osafimmnd[2282]: NO
Global discard node received for nodeId:10f0f pid:2354
  165140:2014-02-25T22:47:30.473073+00:00 scm2 osafimmnd[2282]: NO
Implementer disconnected 59 <0, 10f0f(down)> (@s

2. Link loss is not supported in OpenSAF, This is a split brain case
where in both nodes are trying to becoming active.Please check the link
why it is lost.


/Neel.


On Wednesday 26 February 2014 10:17 PM, Tony Hart wrote:
> Hi Mathi,
>
> I’ve attached the logs of the two controllers from the recent failure.  The 
> 'Quiesced FAILED’ log occurred on scm1 at 2014-02-25T22:47:57.  It seems that 
> around that time scm2 was seeing a lot of osaf errors.  It almost seems like 
> communication was lost between both controllers?
>
>   115145:2014-02-25T22:47:57.992250+00:00 scm1 osafamfd[2502]: ER FAILOVER 
> Active --> Quiesced FAILED, ImplementerClear failed 5
>




------------------------------------------------------------------------------
Flow-based real-time traffic analytics software. Cisco certified tool.
Monitor traffic, SLAs, QoS, Medianet, WAAS etc. with NetFlow Analyzer
Customize your own dashboards, set traffic alerts and generate reports.
Network behavioral analysis & security monitoring. All-in-one tool.
http://pubads.g.doubleclick.net/gampad/clk?id=126839071&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-users

Reply via email to