[tickets] [opensaf:tickets] #2747 ntf: ntfd abort with theNtfAdmin is NULL
- **status**: accepted --> review --- ** [tickets:#2747] ntf: ntfd abort with theNtfAdmin is NULL** **Status:** review **Milestone:** 5.18.01 **Created:** Thu Dec 21, 2017 09:42 AM UTC by Canh Truong **Last Updated:** Thu Dec 21, 2017 09:42 AM UTC **Owner:** Canh Truong 2017-12-21 01:08:23.207 SC-1 osafsmfnd[347]: NO MDS mds_svc_event: NCSMDS_DOWN smfd_dest = 0 2017-12-21 01:08:23.230 SC-1 osaffmd[182]: NO AMFND down on: 2020f 2017-12-21 01:08:23.232 SC-1 osafntfd[232]: src/ntf/ntfd/NtfAdmin.cc:1053: SetClientsDownFlag: Assertion 'NtfAdmin::theNtfAdmin != NULL' failed. The ntfa down during start ntfd when ntfAdmin has not yet created. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2737 amfd: Fail to saImmOiInitialize causes unexpected reboot in node shutting down
- **status**: review --> fixed - **assigned_to**: Minh Hon Chau --> nobody - **Comment**: [develop] commit d231ba43f36ed056c6b83a5739d7aa166ea73317 Author: Minh ChauDate: Thu Dec 21 12:06:35 2017 +1100 amfd: Avoid IMM reinitialization in OpenSAF components termination phase [#2737] [release] commit 23052a89e1fd6080d640bd82ab3795369ee3c12a Author: Minh Chau Date: Thu Dec 21 12:06:35 2017 +1100 amfd: Avoid IMM reinitialization in OpenSAF components termination phase [#2737] --- ** [tickets:#2737] amfd: Fail to saImmOiInitialize causes unexpected reboot in node shutting down ** **Status:** fixed **Milestone:** 5.18.01 **Created:** Tue Dec 12, 2017 10:52 AM UTC by Minh Hon Chau **Last Updated:** Tue Dec 19, 2017 02:15 AM UTC **Owner:** nobody When shutting down active controller, local amfnd terminates Opensaf components. Hence, amfd will get bad handle and try to reintialize the services handle. At some point, amfd gets SA_AIS_ERR_LIBRARY returned from saImmOiInitialize() and exit. This causes unexpected node reboot. 2017-12-07 21:14:14.663 SC-2 systemd[1]: Stopping OpenSAF daemon... 2017-12-07 21:14:14.675 SC-2 opensafd: Stopping OpenSAF Services 2017-12-07 21:14:14.682 SC-2 osafamfnd[262]: NO Shutdown initiated 2017-12-07 21:14:14.685 SC-2 osafamfnd[262]: NO Terminating all AMF components 2017-12-07 21:14:15.507 SC-2 osafimmd[196]: NO MDS event from svc_id 25 (change:4, dest:567412424442034) 2017-12-07 21:14:15.508 SC-2 osafimmnd[207]: NO Global discard node received for nodeId:2040f pid:178 2017-12-07 21:14:15.521 SC-2 osafimmd[196]: NO MDS event from svc_id 25 (change:4, dest:568511936069806) 2017-12-07 21:14:15.521 SC-2 osafimmnd[207]: NO Global discard node received for nodeId:2050f pid:174 2017-12-07 21:14:15.679 SC-2 osafimmd[196]: NO MDS event from svc_id 25 (change:4, dest:566312912814258) 2017-12-07 21:14:15.679 SC-2 osafimmnd[207]: NO Global discard node received for nodeId:2030f pid:178 2017-12-07 21:14:16.013 SC-2 osaffmd[186]: NO IMMD down on: 2010f 2017-12-07 21:14:16.013 SC-2 osafimmd[196]: NO MDS event from svc_id 24 (change:1, dest:13) 2017-12-07 21:14:16.013 SC-2 osafimmd[196]: NO MDS event from svc_id 24 (change:6, dest:13) 2017-12-07 21:14:16.013 SC-2 osafimmd[196]: WA IMMD lost contact with peer IMMD (NCSMDS_RED_DOWN) 2017-12-07 21:14:16.014 SC-2 osafimmnd[207]: NO Implementer disconnected 10 <0, 2010f> (@safSmf_applier1) 2017-12-07 21:14:16.014 SC-2 osafimmnd[207]: WA DISCARD DUPLICATE FEVS message:1246 2017-12-07 21:14:16.014 SC-2 osafimmnd[207]: WA Error code 2 returned for message type 82 - ignoring 2017-12-07 21:14:16.014 SC-2 osafimmnd[207]: WA DISCARD DUPLICATE FEVS message:1247 2017-12-07 21:14:16.014 SC-2 osafimmnd[207]: WA Error code 2 returned for message type 82 - ignoring 2017-12-07 21:14:16.056 SC-2 osafdtmd[150]: NO Lost contact with 'PL-5' 2017-12-07 21:14:16.058 SC-2 osafrded[177]: NO Peer down on node 0x2010f 2017-12-07 21:14:16.085 SC-2 osafdtmd[150]: NO Lost contact with 'PL-4' 2017-12-07 21:14:16.108 SC-2 osafdtmd[150]: NO Lost contact with 'PL-3' 2017-12-07 21:14:16.167 SC-2 osafamfwd[342]: exiting for shutdown 2017-12-07 21:14:16.168 SC-2 osafclmd[242]: exiting for shutdown 2017-12-07 21:14:16.170 SC-2 osafrded[177]: exiting for shutdown 2017-12-07 21:14:16.171 SC-2 osafclmna[168]: exiting for shutdown 2017-12-07 21:14:16.172 SC-2 osafsmfd[284]: exiting for shutdown 2017-12-07 21:14:16.177 SC-2 osafsmfnd[277]: exiting for shutdown 2017-12-07 21:14:16.219 SC-2 osaffmd[186]: NO FM down on: 2010f 2017-12-07 21:14:16.220 SC-2 osaffmd[186]: NO IMMND down on: 2010f 2017-12-07 21:14:16.220 SC-2 osafimmd[196]: NO MDS event from svc_id 25 (change:4, dest:564113889558735) 2017-12-07 21:14:16.220 SC-2 osafimmd[196]: WA IMMND DOWN on active controller 1 detected at standby immd!! 2. Possible failover 2017-12-07 21:14:16.221 SC-2 osafimmd[196]: NO Skipping re-send of fevs message 1246 since it has recently been resent. 2017-12-07 21:14:16.221 SC-2 osafimmd[196]: NO Skipping re-send of fevs message 1247 since it has recently been resent. 2017-12-07 21:14:16.221 SC-2 osafimmnd[207]: NO Global discard node received for nodeId:2010f pid:207 2017-12-07 21:14:16.221 SC-2 osafimmnd[207]: NO Implementer disconnected 1 <0, 2010f(down)> (safLogService) 2017-12-07 21:14:16.221 SC-2 osafimmnd[207]: NO Implementer disconnected 2 <0, 2010f(down)> (@safLogService_appl) 2017-12-07 21:14:16.221 SC-2 osafimmnd[207]: NO Implementer disconnected 3 <0, 2010f(down)> (@OpenSafImmReplicatorA) 2017-12-07 21:14:16.221 SC-2 osafimmnd[207]: NO Implementer disconnected 4 <0, 2010f(down)> (safClmService) 2017-12-07 21:14:16.221 SC-2 osafimmnd[207]: NO Implementer disconnected 5 <0, 2010f(down)> (safAmfService) 2017-12-07 21:14:16.221 SC-2 osafimmnd[207]: NO Implementer disconnected 6 <0, 2010f(down)> (OpenSafImmPBE) 2017-12-07 21:14:16.221 SC-2 osafimmnd[207]: NO Implementer disconnected 8
[tickets] [opensaf:tickets] #2744 build: lib directory may not exist when all-local is built
- **status**: review --> fixed - **Comment**: commit 9134d5d6d87a374546b669a1e59b0ce9fb340996 (HEAD -> develop, origin/develop) Author: Anders WidellDate: Thu Dec 21 13:54:16 2017 +0100 build: Add missing mkdir in toplevel Makefile.am [#2744] In a parallel build, the $(top_builddir)/lib may not yet exist when building the all-local target. Make sure the directory exists using mkdir -p --- ** [tickets:#2744] build: lib directory may not exist when all-local is built** **Status:** fixed **Milestone:** 5.18.01 **Created:** Wed Dec 20, 2017 01:09 PM UTC by Anders Widell **Last Updated:** Thu Dec 21, 2017 11:33 AM UTC **Owner:** Anders Widell When running a parallel build, $(top_builddir)/lib may not yet exist when the all-local target is built. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2634 clm: clmtest does not handle SA_AIS_ERR_TRY_AGAIN
- **status**: accepted --> review --- ** [tickets:#2634] clm: clmtest does not handle SA_AIS_ERR_TRY_AGAIN** **Status:** review **Milestone:** 5.18.01 **Created:** Wed Oct 18, 2017 03:47 AM UTC by Vu Minh Nguyen **Last Updated:** Fri Nov 03, 2017 09:50 PM UTC **Owner:** Vu Minh Nguyen Running `clmtest` sometimes get failed because of not handling `SA_AIS_ERR_TRY_AGAIN`. > Suite 1: Life Cykel API > 1 PASSED saClmInitialize with A.01.01 SA_AIS_OK?[0m > 2 PASSED saClmInitialize_4 with A.04.01 SA_AIS_OK?[0m > 3 PASSED saClmInitialize with NULL pointer to handle?[0m > 4 PASSED saClmInitialize_4 with NULL pointer to handle?[0m > 5 PASSED saClmInitialize with NULL pointer to callback?[0m > error: in src/clm/apitest/tet_saClmInitialize.c at 56: SA_AIS_ERR_BAD_HANDLE > (9), expected SA_AIS_OK (1) - exiting This ticket intends to create CLM APIs' wrappers that internally handling try again error code same as `immutil_` stuffs. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2748 IMM: IMMND assrets as AdminOwner Id is missing on one of the Payload
--- ** [tickets:#2748] IMM: IMMND assrets as AdminOwner Id is missing on one of the Payload** **Status:** unassigned **Milestone:** 5.18.01 **Created:** Thu Dec 21, 2017 11:56 AM UTC by Ravi Sekhar Reddy **Last Updated:** Thu Dec 21, 2017 11:56 AM UTC **Owner:** nobody In multi node setup, if any payload node reboots Sync is triggered to sync the IMM Database . During Sync time, other then the sync node all other nodes in the Cluster will be in the IMM_NODE_R_AVAILABLE state. If an Object is added during this time , Active IMMD sends AdminOwnerSync init request to all the nodes. As nodes are in IMM_NODE_R_AVAILABLE state IMMND will not add the AdminOwner for the object in its database. During VerifySync as admin owner is not present in the payload IMMND we are aborting by asserting. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2743 ais: change default retry in decorator
- **status**: review --> fixed - **assigned_to**: Vu Minh Nguyen --> nobody - **Comment**: commit a1834165152850d792823ef01bed473c96beb2e9 (HEAD, origin/develop, ticket-2743, develop) Author: Vu Minh NguyenDate: Thu Dec 21 18:39:55 2017 +0700 ais: change default retry in decorator [#2743] Change interval time to 100 miliseconds, and timeout to one minute for the default retry control. --- ** [tickets:#2743] ais: change default retry in decorator** **Status:** fixed **Milestone:** 5.18.01 **Created:** Wed Dec 20, 2017 11:58 AM UTC by Vu Minh Nguyen **Last Updated:** Wed Dec 20, 2017 12:37 PM UTC **Owner:** nobody Change interval time to 100 miliseconds, and timeout to one minute for the default retry control. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2744 build: lib directory may not exist when all-local is built
- **status**: accepted --> review --- ** [tickets:#2744] build: lib directory may not exist when all-local is built** **Status:** review **Milestone:** 5.18.01 **Created:** Wed Dec 20, 2017 01:09 PM UTC by Anders Widell **Last Updated:** Wed Dec 20, 2017 01:09 PM UTC **Owner:** Anders Widell When running a parallel build, $(top_builddir)/lib may not yet exist when the all-local target is built. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2738 base: Consolitate code for reading node ID
- **status**: review --> fixed - **Comment**: commit ec8a376bdeb75f0ba0d418ba7d9e8f16958b34c3 (HEAD -> develop) Author: Anders WidellDate: Thu Dec 21 11:07:59 2017 +0100 base: Clean up and remove dead code around ncs_get_node_id() [#2738] commit 238d072d8bdc020b9fcc6b6321dd17d139506a35 Author: Anders Widell Date: Thu Dec 21 11:07:59 2017 +0100 clm: Use ncs_get_node_id() to read the Node ID [#2738] commit ce827d7a8d3052f6408260d919a92d5039e7ebd6 Author: Anders Widell Date: Thu Dec 21 11:07:59 2017 +0100 mds: Use ncs_get_node_id() to read the Node ID [#2738] --- ** [tickets:#2738] base: Consolitate code for reading node ID** **Status:** fixed **Milestone:** 5.18.01 **Created:** Thu Dec 14, 2017 02:26 PM UTC by Anders Widell **Last Updated:** Thu Dec 14, 2017 03:57 PM UTC **Owner:** Anders Widell As a preparation for ticket [#1755], consolidate to code used to read the Node ID, and make sure all places in OpenSAF use the function ncs_get_node_id() instead of some other mechanism for reading it. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #2747 ntf: ntfd abort with theNtfAdmin is NULL
--- ** [tickets:#2747] ntf: ntfd abort with theNtfAdmin is NULL** **Status:** accepted **Milestone:** 5.18.01 **Created:** Thu Dec 21, 2017 09:42 AM UTC by Canh Truong **Last Updated:** Thu Dec 21, 2017 09:42 AM UTC **Owner:** Canh Truong 2017-12-21 01:08:23.207 SC-1 osafsmfnd[347]: NO MDS mds_svc_event: NCSMDS_DOWN smfd_dest = 0 2017-12-21 01:08:23.230 SC-1 osaffmd[182]: NO AMFND down on: 2020f 2017-12-21 01:08:23.232 SC-1 osafntfd[232]: src/ntf/ntfd/NtfAdmin.cc:1053: SetClientsDownFlag: Assertion 'NtfAdmin::theNtfAdmin != NULL' failed. The ntfa down during start ntfd when ntfAdmin has not yet created. --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets