The MDS send call(AVA_SEND) returns after sending the message! Subsequently the 
process exit would generate an AVA down. But, i think theoretically the chances 
that an AVA_RESP  and AVA_DOWN could reach out of order is extremely less(the 
only situation i can think of is when the main thread of application has 
starved the MDS thread) *than* when comparing to the chances of an out of order 
of AVA_FINALIZE and AVA_DOWN. 
The AVA_FINALIZE results in a MDS shutdown.

I just checked, in the numerous iterations i tried on a 64-bit setup, the AMF 
response always reached first and the ava down later!

Having said that, if the component timesout on the callback response or returns 
error, AMF proceeds to cleanup the component.

The following are the possible outcomes in this scenario of admin restart 
operation:
(a) IF all goes well, the component gets restarted(re INSTANTIATED)
(b) IF component timesout or returns error then AMF would invoke the cleanup 
script of the component and then component gets restarted(re INSTANTIATED).
(c) IF it can happen that the AVA_DOWN(process exit) message had preceeded the 
AVA_send, AMF would invoke the cleanup script and then component gets restarted.

While the end result is the same for all the cases, the case (c) could still be 
considered safe because the component would have responded to AMF only after 
preparing itself for the exit/kill! Hmm..


---

** [tickets:#501] amf: No node directors register to AMF within time after "#7 
cleanup instead of terminate used at component restart"**

**Status:** unassigned
**Created:** Thu Jul 11, 2013 07:47 AM UTC by Ingvar Bergström
**Last Updated:** Thu Jul 18, 2013 01:31 PM UTC
**Owner:** nobody

After introduction of patches solving "#7 cleanup instead of terminate used at 
component restart”, no node directors registers to AMF within time according to 
messages log.
I have tried SMFND, CPND, GLND and MQND.

It seems however that the main routines of the node director daemons are not 
started until 10 seconds after the terminate callback (after the registration 
timeout).

It is very easy to see the fault by entering command "amf-adm restart 
safComp=xxxND,safSu=SC-1,safSg=NoRed,safApp=OpenSAF"


---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to