- **Comment**:
Here's a trace snippet from an opensaf start that it is hard to explain...
Sep 8 13:47:55.777790 osafimmnd [5233:mds_c_api.c:1614] TR svc UP process_info
NOTEXIST, svc:26, adest:2020f53b80025
Sep 8 13:47:55.777801 osafimmnd [5233:mds_c_db.c:2352] >>
mds_process_info_add: dest:2020f53b80025, pid:0
Sep 8 13:47:55.777987 osafimmnd [5233:mds_main.c:0151] TR mds: received 77
from 2020f53b80025, pid 5335
Sep 8 13:47:55.778006 osafimmnd [5233:mds_main.c:0167] TR dest 2020f53b80025
already exist
Sep 8 13:47:55.792541 osafimmnd [5233:mds_c_api.c:2675] TR svc 26 DOWN cnt:0,
adest:2020f53b80025
Sep 8 13:47:55.792557 osafimmnd [5233:mds_c_db.c:2361] >>
mds_process_info_del: dest:2020f53b80025, pid:5335
Sep 8 13:47:55.792655 osafimmnd [5233:mds_c_api.c:1614] TR svc UP process_info
NOTEXIST, svc:26, adest:2020f53b80025
Sep 8 13:47:55.792679 osafimmnd [5233:mds_c_db.c:2352] >>
mds_process_info_add: dest:2020f53b80025, pid:0
Sep 8 13:47:55.792701 osafimmnd [5233:mds_main.c:0151] TR mds: received 77
from 2020f53b80025, pid 5335
Sep 8 13:47:55.792945 osafimmnd [5233:mds_main.c:0167] TR dest 2020f53b80025
already exist
Sep 8 13:47:55.811859 osafimmnd [5233:mds_main.c:0151] TR mds: received 77
from 2020f53b80025, pid 5335
Sep 8 13:47:55.811903 osafimmnd [5233:mds_main.c:0167] TR dest 2020f53b80025
already exist
Sep 8 13:47:55.811994 osafimmnd [5233:mds_c_api.c:2675] TR svc 26 DOWN cnt:0,
adest:2020f53b80025
Sep 8 13:47:55.812008 osafimmnd [5233:mds_c_db.c:2361] >>
mds_process_info_del: dest:2020f53b80025, pid:5335
Sep 8 13:47:55.812091 osafimmnd [5233:mds_c_api.c:1614] TR svc UP process_info
NOTEXIST, svc:26, adest:2020f53b80025
Sep 8 13:47:55.812104 osafimmnd [5233:mds_c_db.c:2352] >>
mds_process_info_add: dest:2020f53b80025, pid:0
Sep 8 13:47:55.812194 osafimmnd [5233:immnd_evt.c:0726] WA
immnd_evt_proc_imm_init: PID 0 (5335) for 2020f53b80025, MDS problem?
Sep 8 13:47:55.812742 osafimmnd [5233:mds_c_api.c:2675] TR svc 26 DOWN cnt:0,
adest:2020f53b80025
Sep 8 13:47:55.812760 osafimmnd [5233:mds_c_db.c:2361] >>
mds_process_info_del: dest:2020f53b80025, pid:0
pid:5335 is amfnd
---
** [tickets:#1050] amfnd sometimes fails to start due to ERR_LIBRARY from
saImmOmInitialize**
**Status:** review
**Milestone:** 4.5.0
**Created:** Tue Sep 09, 2014 07:08 AM UTC by Hans Feldt
**Last Updated:** Mon Sep 15, 2014 01:45 PM UTC
**Owner:** Hans Feldt
With MDS/TIPC amfnd randomly fails to start causing failed opensaf start.
osafimmnd logs the infamous "immnd_evt_proc_imm_init: ... MDS problem?"
Reason is a random timing variation of the TIPC topology DOWN event. This
sometimes causes the DOWN event to wrongly delete a newly added process_info
entry.
The trigger for this problem is that some IMM clients in opensaf like amfnd
does not reuse IMM handles but initialize/finalize in a far from optimal way.
This should also be fixed.
The solution under test consists of two parts:
1) The MDS down event just starts a timer in MDS, when the timeout event
happens the process_info entry is deleted.
2) A new explicit disconnect() is added to the MDS API which is used by IMMA
library when it is about to close down the whole core library.
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Want excitement?
Manually upgrade your production database.
When you want reliability, choose Perforce
Perforce version control. Predictably reliable.
http://pubads.g.doubleclick.net/gampad/clk?id=157508191&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets