[tickets] [opensaf:tickets] #2485 amfnd: missing susi response if component is restarted
--- ** [tickets:#2485] amfnd: missing susi response if component is restarted** **Status:** unassigned **Milestone:** 5.17.06 **Created:** Wed Jun 07, 2017 12:57 AM UTC by Gary Lee **Last Updated:** Wed Jun 07, 2017 12:57 AM UTC **Owner:** nobody An SI contains multiple CSIs. If a restart component admin operation arrives at amfnd before all CSIs are assigned, the SUSI response is not sent to AMFD. This code in avnd_comp_csi_assign_done() appears to be the problem area. /* while restarting, we wont use assign all, so csi will not be null */ if (csi && m_AVND_COMP_CSI_CURR_ASSIGN_STATE_IS_RESTARTING(csi)) { m_AVND_COMP_CSI_CURR_ASSIGN_STATE_SET(csi, AVND_COMP_CSI_ASSIGN_STATE_ASSIGNED); goto done; } Perhaps we should not initiate a restart in avnd_evt_comp_admin_op_req(), if a component is still in AVND_COMP_CSI_ASSIGN_STATE_ASSIGNING state. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2484 imm: Testsuit 7 of immoitest fails with ERR_TRY_AGAIN
- **status**: accepted --> review --- ** [tickets:#2484] imm: Testsuit 7 of immoitest fails with ERR_TRY_AGAIN** **Status:** review **Milestone:** 5.17.06 **Created:** Tue Jun 06, 2017 04:25 AM UTC by Hung Nguyen **Last Updated:** Tue Jun 06, 2017 04:25 AM UTC **Owner:** Hung Nguyen ~~~ # immoitest --longDn 7 Suite 7: Long DN 1 PASSED SA_AIS_OK - Object create callback; 2 PASSED SA_AIS_OK - Object modify callback; 3 PASSED SA_AIS_OK - Object delete callback; 4 PASSED SA_AIS_OK - Rt Object create and delete; error: in src/imm/apitest/implementer/test_saImmOiLongDn.c at 500: SA_AIS_ERR_TRY_AGAIN (6), expected SA_AIS_OK (1) - exiting ~~~ The testcases of testsuite 7 use the same implementer name. At the end of each testcase, the implementer name is not explicitly cleared with saImmOiImplementerClear(). Instead, saImmOiFinalize() is called. The difference between OiImplementerClear() and OiFinalize() is: \- OiImplementerClear() returns after the implementer is fully discarded on all nodes \- saImmOiFinalize() returns after the implementer is locally discarded on the originating node (no guarantee of being fully discarded) So if you set the same implementer name after saImmOiFinalize(), chances are you will get ERR_TRY_AGAIN. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2478 clmtest: failed test cases
- **status**: review --> fixed - **Milestone**: 5.17.08 --> 5.17.06 - **Comment**: 5.17.08 (develop) [code:664e46] ~~~ commit 664e46e54ef3c4b2e11b84e5efc73916ba6f6fc2 Author: Hoang VoDate: Tue Jun 6 17:39:02 2017 +0700 clmtest: Correct test case following AIS [#2478] Correct test case following AIS. ~~~ 5.17.06 (release) [code:360bb7] ~~~ commit 360bb7dee0b8d85ae7e25d70ba1c7d7eab034b8a Author: Hoang Vo Date: Tue Jun 6 17:39:02 2017 +0700 clmtest: Correct test case following AIS [#2478] Correct test case following AIS. ~~~ --- ** [tickets:#2478] clmtest: failed test cases** **Status:** fixed **Milestone:** 5.17.06 **Created:** Fri Jun 02, 2017 06:20 AM UTC by Vo Minh Hoang **Last Updated:** Fri Jun 02, 2017 07:32 AM UTC **Owner:** Vo Minh Hoang Test cases failed due to inconsistence with specification: Suite 2 - saClmSelectionObjectGet NULL handle Suite 5 - saClmClusterNodeGet & saClmClusterNodeGet_4 with NULL timeout Suite 5 - saClmClusterNodeGet & saClmClusterNodeGet_4 with nodeId of non member node Suite 7 - saClmClusterTrack_4 with null handle or invalid handle Suite 7 - saClmClusterTrack_4 with null flags --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2468 amf: amfd asserts while decrementing opensaf NoRed SI assignment counter during fail-over.
Hi, In the later releases assert has been replaced with warning. Without amfd traces, it is not possbile to know why the counter was decremented before the assert. I will try to reproduce based on code analysis and will update further. Thanks Praveen --- ** [tickets:#2468] amf: amfd asserts while decrementing opensaf NoRed SI assignment counter during fail-over.** **Status:** unassigned **Milestone:** 5.17.06 **Created:** Thu May 25, 2017 08:46 AM UTC by Praveen **Last Updated:** Thu Jun 01, 2017 02:41 PM UTC **Owner:** nobody Ticket is based on a issue reported via user list mail dated: 22-May-17, subject "[users] osafamfd coredump issue. Here is syslog when the issue occurred: 2017-05-01T07:52:57.714906-04:00 scm2 kernel: tipc: Resetting link <1.1.16:eth2-1.1.5:bond0>, peer not responding 2017-05-01T07:52:57.714935-04:00 scm2 kernel: tipc: Lost link <1.1.16:eth2-1.1.5:bond0> on network plane A 2017-05-01T07:52:57.714939-04:00 scm2 kernel: tipc: Lost contact with <1.1.5> 2017-05-01T07:52:57.716788-04:00 scm2 osafimmd[3009]: NO MDS event from svc_id 25 (change:4, dest:287038266327043) 2017-05-01T07:52:57.717304-04:00 scm2 osafclmd[4259]: NO Node 66831 went down. Not sending track callback for agents on that node 2017-05-01T07:52:57.719178-04:00 scm2 osafimmnd[3020]: NO Global discard node received for nodeId:1050f pid:15395 2017-05-01T07:52:57.719233-04:00 scm2 osafimmnd[3020]: NO Implementer disconnected 104 <0, 1050f(down)> (MsgQueueService66831) 2017-05-01T07:52:57.721345-04:00 scm2 osafamfd[4277]: NO Node 'PLD0105' left the cluster 2017-05-01T07:52:57.722778-04:00 scm2 log_demo[6160]: [0.I.Proc]: FYI state change notification from NTF, entity PLD0105 now has new state DISABLED (Oper state safAmfNode=PLD0105,safAmfCluster=myAmfCluster changed) 2017-05-01T07:52:57.732796-04:00 scm2 osafamfd[4277]: su.cc:2006: dec_curr_act_si: Assertion 'saAmfSUNumCurrActiveSIs > 0' failed. 2017-05-01T07:52:57.778777-04:00 scm2 kernel: tipc: Resetting link <1.1.16:eth2-1.1.6:bond0>, peer not responding 2017-05-01T07:52:57.778827-04:00 scm2 kernel: tipc: Lost link <1.1.16:eth2-1.1.6:bond0> on network plane A 2017-05-01T07:52:57.778833-04:00 scm2 kernel: tipc: Lost contact with <1.1.6> 2017-05-01T07:52:57.777979-04:00 scm2 osafimmd[3009]: NO MDS event from svc_id 25 (change:4, dest:288139774320643) 2017-05-01T07:52:57.717343-04:00 scm2 osafclmd[4259]: NO Node 66831 went down. Not sending track callback for agents on that node 2017-05-01T07:52:57.779373-04:00 scm2 osafclmd[4259]: NO Node 67087 went down. Not sending track callback for agents on that node 2017-05-01T07:52:57.780552-04:00 scm2 osafimmnd[3020]: NO Global discard node received for nodeId:1060f pid:17439 2017-05-01T07:52:57.780607-04:00 scm2 osafimmnd[3020]: NO Implementer disconnected 106 <0, 1060f(down)> (MsgQueueService67087) 2017-05-01T07:52:57.810785-04:00 scm2 osafamfnd[5281]: WA AMF director unexpectedly crashed 2017-05-01T07:52:57.810839-04:00 scm2 osafamfnd[5281]: Rebooting OpenSAF NodeId = 69647 EE Name = , Reason: local AVD down(Adest) or both AVD down(Vdest) received, OwnNodeId = 69647, SupervisionTime = 0 2017-05-01T07:52:57.810978-04:00 scm2 osafimmnd[3020]: NO Implementer locally disconnected. Marking it as doomed 105 <29, 1100f> (safAmfService) 2017-05-01T07:52:57.812582-04:00 scm2 osafimmnd[3020]: NO Implementer disconnected 105 <29, 1100f> (safAmfService) 2017-05-01T07:52:57.950567-04:00 scm2 opensaf_reboot: Rebooting local node; timeout=0 2017-05-01T07:52:58.084968-04:00 scm2 atwdog[28335]: rebooting (-f) local node --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets