[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
- Description has changed: Diff: --- old +++ new @@ -8,6 +8,6 @@ May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? - May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) + ... May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? - **Blocker**: --> False --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** fixed **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael Odzakow **Last Updated:** Mon Jun 06, 2016 03:26 AM UTC **Owner:** A V Mahesh (AVM) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? ... May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
- Attachments has changed: Diff: --- old +++ new @@ -1 +0,0 @@ -sc2_mds.log (2.6 MB; text/x-log) --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** fixed **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael Odzakow **Last Updated:** Mon Jun 06, 2016 03:26 AM UTC **Owner:** A V Mahesh (AVM) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
- **status**: assigned --> review --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** review **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Thu May 26, 2016 04:46 AM UTC **Owner:** A V Mahesh (AVM) **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic patterns at an interface-level. Reveals which users, apps, and protocols are consuming the most bandwidth. Provides multi-vendor support for NetFlow, J-Flow, sFlow and other flows. Make informed decisions using capacity planning reports. https://ad.doubleclick.net/ddm/clk/305295220;132659582;e___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
Here's the result X | PID exist | PID not exist --- | --- | --- OpenSAF as non-root | kill() returns -1errno is EPERM | kill() returns -1errno is ESRCH OpenSAF as root | kill() returns 0 | kill() returns -1errno is ESRCH --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** assigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Thu May 26, 2016 03:37 AM UTC **Owner:** A V Mahesh (AVM) **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
Let me understand bit more about the return values in your environment before publishing the final patch. 1) what dose`kill(info->pid, 0)` returning if PUD exist Opensaf as NON-ROOT 2) what dose `kill(info->pid, 0)` returningif PUD NOT exist Opensaf as NON-ROOT 3) what is the `errno` reported if PUD exist Opensaf as NON-ROOT 4) what is the `errno` reported if PUD NOT exist Opensaf as NON-ROOT 5) what dose`kill(info->pid, 0)` returning if PUD exist Opensaf as ROOT 6) what dose `kill(info->pid, 0)` returningif PUD NOT exist Opensaf as ROOT 7) what is the `errno` reported if PUD exist Opensaf as ROOT 8) what is the `errno` reported if PUD NOT exist Opensaf as ROOT -AVM --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** assigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Thu May 26, 2016 03:20 AM UTC **Owner:** A V Mahesh (AVM) **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
Whic one works : if ((info != NULL) && (kill(info->pid, 0) == -1) && (errno != EPERM)) or if ((info != NULL) && (kill(info->pid, 0) == -1) && (errno == EPERM)) -AVM --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** assigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Tue May 24, 2016 09:54 AM UTC **Owner:** A V Mahesh (AVM) **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
Then give a try with `if ((info != NULL) && ((kill(info->pid, 0) == -1) || errno == EPERM)` in osaf/libs/core/mds/mds_dt_common.c and let me know the result By the way with Linux flavor you are using ? Are you running opensaf as NON root user ? --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** assigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Tue May 24, 2016 07:45 AM UTC **Owner:** A V Mahesh (AVM) **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
- **status**: unassigned --> assigned - **assigned_to**: A V Mahesh (AVM) - **Comment**: > Hung Nguyen wrote: > > But sometimes we receive in order like this: > Receive IMMND_EVT_A2ND_IMM_FINALIZE > MDS_REGISTER_REQ messages > MDS_UNREGISTER_REQ messages > Receive IMMND_EVT_A2ND_IMM_INIT > NCSMDS_DOWN event (TIPC_WITHDRAWN) > NCSMDS_UP event (TIPC_PUBLISHED) > > When receiving MDS_UNREGISTER_REQ messages, the process info is removed from > >process_info_db. Then MDS_REGISTER_REQ comes right after that and the > process info >is added to process_info_db. Then when the TIPC_WITHDRAWN event > comes, >mds_mcm_svc_down() will check and see that process info is still > there, timer is started. >After 10 seconds, the process info is cleared. So In your case auth-server thread MDS_UNREGISTER_REQ messages got delayed , so process info is NOT cleared at the time of mds_mcm_svc_down() , so MDS_DOWN_TMR is started. and your IMMA client re-registers immediately after unregistered , because of that MDS_REGISTER_REQ messages of new auth-server thread landed earlier than MDS_UNREGISTER_REQ of previous auth-server thread, is causing the problem. But even mds_mcm_svc_down() comes late and timer is started and expires After 10 seconds, based on the `case MDS_DOWN_TMR` code , process info is NOT cleared because of `if ((info != NULL) && (kill(info->pid, 0) == -1))` condition, which delete process_info , if process not exist to avoid race with a client that re-registers immediately after unregistered. osaf/libs/core/mds/mds_dt_common.c === case MDS_DOWN_TMR: { MDS_PROCESS_INFO *info = mds_process_info_get( tmr_req_info->info.down_event_tmr_info.adest, tmr_req_info->info.down_event_tmr_info.svc_id); /* only delete if process not exist to avoid race with a client * that re-registers immediately after unregister */ if ((info != NULL) && (kill(info->pid, 0) == -1)) { TRACE("TIMEOUT, deleting entry for %"PRIx64", pid:%d", info->mds_dest, info->pid); (void)mds_process_info_del(info); free(info); } if (tmr_req_info->info.down_event_tmr_info.tmr_id != NULL) { ncs_tmr_free(tmr_req_info->info.down_event_tmr_info.tmr_id); } break; } === > Hung Nguyen wrote: > I think the process info should not be cleared in this case (by clearing the > timer when >receving TIPC_PUBLISHED event). This could be a solution , but fist we need to check why ` (kill(info->pid, 0) == -1)` is failing in your environment , alternatively we can also see the auth-server thread join and priarity , ect ... --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** assigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Mon May 23, 2016 03:58 AM UTC **Owner:** A V Mahesh (AVM) **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions.
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
>>The only thing we can make sure that will come in order is NCSMDS_DOWN and >>NCSMDS_UP. >>So I still think that MDS should clear the down timer when receiving >>NCSMDS_UP. You mean NCSMDS_DOWN and NCSMDS_UP MDS events are kept on separate sockets (Dsock and BSRsock) ? or NCSMDS_DOWN/NCSMDS_UP MDS events and "withdrawn event timer" events are kept on separate sockets (Dsock and BSRsock) ? I understand you observations as follows : In your setup, some times you are receiving the NCSMDS_UP for IMMA earlier than the previous "withdrawn event timer" expire, which was started to remove the NCSMDS_DOWN for the same IMMA info as new IMMA , which is being kept on different socket is getting processed after the new NCSMDS_UP for IMMA ( same adest+svc_id ) and removing the PID information of the new same adest+svc_id. is that right ? --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** unassigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Thu May 19, 2016 10:46 AM UTC **Owner:** nobody **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
Update: I checked the log again and saw that the IMMA down event (NCSMDS_DOWN) is not unexpected. It just came late. ~~~ 11:26:21.775374 [immsv_evt.c:5473] T8 Received: IMMND_EVT_A2ND_IMM_FINALIZE (5) from 2010f 11:26:21.775529 [mds_main.c:0153] TR MDS: received 79 from 2010ffc93cf53, pid 5725 11:26:21.776493 [mds_main.c:0153] TR MDS: received 77 from 2010ffc93cf53, pid 5725 11:26:21.776703 [immsv_evt.c:5473] T8 Received: IMMND_EVT_A2ND_IMM_INIT (3) from 2010f 11:26:21.779147 [immnd_evt.c:10368] T2 IMMA DOWN EVENT 11:26:21.779237 [immnd_evt.c:10443] T2 IMMA UP EVENT ~~~ The normal order would be ~~~ 17:29:00.897880 [immsv_evt.c:5473] T8 Received: IMMND_EVT_A2ND_IMM_FINALIZE (5) from 2010f 17:29:00.898182 [mds_main.c:0153] TR MDS: received 79 from 2010fd6dc3d6d, pid 685 17:29:00.898526 [immnd_evt.c:10386] T2 IMMA DOWN EVENT 17:29:00.899954 [immnd_evt.c:10461] T2 IMMA UP EVENT 17:29:00.900299 [mds_main.c:0153] TR MDS: received 77 from 2010f28dd3e79, pid 685 17:29:00.900532 [immsv_evt.c:5473] T8 Received: IMMND_EVT_A2ND_IMM_INIT (3) from 2010f ~~~ So the problem is not tipc connection flickering. It's just that the event came late. We can't control the order since MDS events and MDS messages come from seperate sockets (Dsock and BSRsock) and the auth server is an seperate thread. The only thing we can make sure that will come in order is NCSMDS_DOWN and NCSMDS_UP. So I still think that MDS should clear the down timer when receiving NCSMDS_UP. --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** unassigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Wed May 18, 2016 09:07 AM UTC **Owner:** nobody **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
Hi Mahesh, Here's the immnd trace and mds log for this problem. ~~~ 11:26:21.778367 osafimmnd[5238] NOTIFY |MDTM: svc down event for svc_id = IMMA_OM(26), subscri. by svc_id = IMMND(25) pwe_id=1 Adest =11:26:21.778474 osafimmnd[5238] NOTIFY |MDTM: svc up event for svc_id = IMMA_OM(26), subscri. by svc_id = IMMND(25) pwe_id=1 Adest = ~~~ ~~~ 11:26:21.779135 osafimmnd [5238:immsv_evt.c:5473] T8 Received: IMMND_EVT_MDS_INFO (1) from 0 11:26:21.779147 osafimmnd [5238:immnd_evt.c:10368] T2 IMMA DOWN EVENT 11:26:21.779225 osafimmnd [5238:immsv_evt.c:5473] T8 Received: IMMND_EVT_MDS_INFO (1) from 0 11:26:21.779237 osafimmnd [5238:immnd_evt.c:10443] T2 IMMA UP EVENT ~~~ IMM received an NCSMDS_DOWN event and an NCSMDS_UP event right after that. There's no message from client, so the client didn't call ImmOmFinalize()/ImmOmInitialize(). It seemed that the connection is not stable/flickering or something like that (?) Dsock is a tipc socket, maybe this is a problem with tipc (?) When there's a "withdrawn event", MDS starts a timer to remove the process info after 10 seconds. That's the reason why IMM can't read the PID. ~~~ 11:26:31.878622 osafimmnd [5238:mds_dt_common.c:0862] TR TIMEOUT, deleting entry for 2010ffc93cf53, pid:5725 11:26:31.878655 osafimmnd [5238:mds_c_db.c:2527] >> mds_process_info_del: dest:2010ffc93cf53, pid:5725, svc:26 ~~~ The question is: should/does MDS clear the timer when it receives "published event" from the same adest+svc_id ? The problem can be solved if the timer is cleared. Attachments: - [logs.tgz](https://sourceforge.net/p/opensaf/tickets/_discuss/thread/62c4df9d/b855/attachment/logs.tgz) (5.5 MB; application/x-compressed) --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** unassigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Tue May 17, 2016 11:44 AM UTC **Owner:** nobody **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
The test-case (simplified); 1. Start a virtual cluster 2. Installs the SW-under-test with a "campaign" with cluster reboot ~02:19, SAF started ~02:20:09 3. Performs some tests (most likely not related) 4. Starts a "scale-out" by starting 2 new VM's in the cluster ~02:30 5. The new VM's boots using a "proble-kernel" 6. SAF is made aware of the new VM's and are supposed to include them in the SAF cluster ~02:31 I think the logs at ~2:20 is turbulence from the install and SAF start. The fault is intermittent and occurs around 1 out of 20 nightly tests so we can't give an instruction how to reproduce it. It does however re-occurs so if we can look for something in particular or turn on some extra logging we can do so and provide the information. --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** unassigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Mon May 16, 2016 03:54 AM UTC **Owner:** nobody **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
checked MDS logs : IMMND PID [6042] started at time stamp of `May 13 2:20:11.902347` May 13 2:20:11.902347 osafimmnd[6042] NOTIFY |BEGIN MDS LOGGING| PID=| ARCHW=9|64bit=1 From the time stamp around `~May 13 2:20:18.821166` a continuous UP and DOWN event of IMMA_OM are received by osafimmnd[6042] (IMMA_OM agent initialized and uninitialized ) May 13 2:20:18.821166 osafimmnd[6042] NOTIFY |MDTM: svc up event for svc_id = IMMA_OM(26), subscri. by svc_id = IMMND(25) pwe_id=1 Adest = May 13 2:20:18.822243 osafimmnd[6042] NOTIFY |MDTM: svc down event for svc_id = IMMA_OM(26), subscri. by svc_id = IMMND(25) pwe_id=1 Adest = Till the timestamps of `May 13 2:20:29.78127`the smiler activity happens for osafntfd[6074] NTFA (NTFA agent initialized and uninitialized ) and then osafimmnd[6042] received the error tat this bug reported May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? Is the test case is running multiple opensaf agent (IMMA_OM/NTFA/ect..) initialized and uninitialized in a continuous loop ? or can you please provide steps to reproduce . --- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** unassigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Fri May 13, 2016 12:34 PM UTC **Owner:** nobody **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets
[tickets] [opensaf:tickets] #1825 MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.
--- ** [tickets:#1825] MDS: When saImmOmInitialize was invoked, MDS sent ‘pid = 0’ in MDS_CALLBACK_RECEIVE_INFO.** **Status:** unassigned **Milestone:** 4.7.2 **Created:** Fri May 13, 2016 12:34 PM UTC by Rafael **Last Updated:** Fri May 13, 2016 12:34 PM UTC **Owner:** nobody **Attachments:** - [sc2_mds.log](https://sourceforge.net/p/opensaf/tickets/1825/attachment/sc2_mds.log) (2.6 MB; text/x-log) This happens only some of the time. First this is called from an application: saImmOmInitialize(immHandle, NULL, ) Then the system log shows this error: May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:53 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: NO Ccb 176 COMMITTED (LDE) May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? May 13 02:31:55 sc2 osafimmnd[6042]: WA immnd_evt_proc_imm_init: PID 0 (7150) for 2020fb1d13cb8, MDS problem? --- Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is subscribed to https://sourceforge.net/p/opensaf/tickets/ To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.-- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Opensaf-tickets mailing list Opensaf-tickets@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/opensaf-tickets