In AMFND logs on ctrl1  a lot of faults are present and AMFND handled it 
correctly. For the analysis, I have considered the faults which occurred when 
shutdown operation was actually performed by AMFD i.e.
Jul  9 12:40:43.180256 osafamfd [3725:avd_sg.c:1336] >> avd_sg_admin_state_set: 
safSg=SGONE,safApp=TWONAPP AdmState UNLOCKED => SHUTTING_DOWN.

According to AMFND logs the fault during shutdown operation occurred not be 
cause of failed operation response in quiescing callback but due to 
"qscingCompleteTimeout" with recovery componentrestart. Faults due to 
"qscingCompleteTimeout" continued till escalation level reached to suFailover. 
Here as part of recovery AMNFD gets quiesced assignments from AMFD for SI3, SI4 
and SI5 respectively. For SI3 it should not get since it is sponsor for SI4. 
For SI3 AMFND did not issue any callback to COMP2 and COMP3 because they are 
faulted. But it did not issue callback to COMP1 also even though it is in 
healthy INSTANITATED (after faults reassignment is already going on it). So 
here operation done response was not generated for SI3. For SI4 and SI5 
callbacks were issued to COMP1 and when responses came, AMFND responded to AMFD 
with operation done event separately for SI4 and SI5. Since operation done 
event was not genereated for SI3, SG is in unstable state and SUSI in SU1 are 
in quiesced stat
 e. Due to this subsequent admin operation will be rejected with "WA SG not in 
STABLE state (safSg=SGONE,safApp=TWONAPP)".



---

** [tickets:#492] Assignments are not removed during SG shutdown with faulty 
component**

**Status:** unassigned
**Created:** Tue Jul 09, 2013 07:32 AM UTC by surender khetavath
**Last Updated:** Tue Jul 09, 2013 07:32 AM UTC
**Owner:** Praveen

Changeset : 4325
Model : TWON
Configuration: 1SG,5SUs having 3comps each, 5SIs with 3Csis each.
Intially: 5Node cluster, SU1 mapped to SC-1,SU2 to SC-2,SU3-PL3,SU4&SU5 to PL-4
SU1 was active and SU2 standby
si-si deps configured as SI1<-SI2<-SI3<-SI4


Test:
Shutdown of SG. A component in active SU is made to reply with FAILED_OP in 
quiescing cbk. sg goes to locked state but assignments are not removed. 
Later unlock of SG fails with time-out

SU states:
safSu=SU2,safSg=SGONE,safApp=TWONAPP
        saAmfSUAdminState=UNLOCKED(1)
        saAmfSUOperState=ENABLED(1)
        saAmfSUPresenceState=INSTANTIATED(3)
        saAmfSUReadinessState=OUT-OF-SERVICE(1)
safSu=SU1,safSg=SGONE,safApp=TWONAPP
        saAmfSUAdminState=UNLOCKED(1)
        saAmfSUOperState=DISABLED(2)
        saAmfSUPresenceState=TERMINATING(4)
        saAmfSUReadinessState=OUT-OF-SERVICE(1)
safSu=SU3,safSg=SGONE,safApp=TWONAPP
        saAmfSUAdminState=UNLOCKED(1)
        saAmfSUOperState=ENABLED(1)
        saAmfSUPresenceState=INSTANTIATED(3)
        saAmfSUReadinessState=OUT-OF-SERVICE(1)
safSu=SU4,safSg=SGONE,safApp=TWONAPP
        saAmfSUAdminState=UNLOCKED(1)
        saAmfSUOperState=ENABLED(1)
        saAmfSUPresenceState=INSTANTIATED(3)
        saAmfSUReadinessState=OUT-OF-SERVICE(1)
safSu=SU5,safSg=SGONE,safApp=TWONAPP
        saAmfSUAdminState=UNLOCKED(1)
        saAmfSUOperState=ENABLED(1)
        saAmfSUPresenceState=UNINSTANTIATED(1)
        saAmfSUReadinessState=OUT-OF-SERVICE(1)

sg state:
safSg=SGONE,safApp=TWONAPP
        saAmfSGAdminState=LOCKED(2)

SUSI states:
safSISU=safSu=SU1\,safSg=SGONE\,safApp=TWONAPP,safSi=TWONSI4,safApp=TWONAPP
        saAmfSISUHAState=QUIESCED(3)
safSISU=safSu=SC-1\,safSg=2N\,safApp=OpenSAF,safSi=SC-2N,safApp=OpenSAF
        saAmfSISUHAState=ACTIVE(1)
safSISU=safSu=SC-1\,safSg=NoRed\,safApp=OpenSAF,safSi=NoRed2,safApp=OpenSAF
        saAmfSISUHAState=ACTIVE(1)
safSISU=safSu=SU1\,safSg=SGONE\,safApp=TWONAPP,safSi=TWONSI2,safApp=TWONAPP
        saAmfSISUHAState=ACTIVE(1)
safSISU=safSu=SU1\,safSg=SGONE\,safApp=TWONAPP,safSi=TWONSI5,safApp=TWONAPP
        saAmfSISUHAState=QUIESCED(3)
safSISU=safSu=SU1\,safSg=SGONE\,safApp=TWONAPP,safSi=TWONSI1,safApp=TWONAPP
        saAmfSISUHAState=ACTIVE(1)
safSISU=safSu=SC-2\,safSg=NoRed\,safApp=OpenSAF,safSi=NoRed1,safApp=OpenSAF
        saAmfSISUHAState=ACTIVE(1)
safSISU=safSu=PL-4\,safSg=NoRed\,safApp=OpenSAF,safSi=NoRed3,safApp=OpenSAF
        saAmfSISUHAState=ACTIVE(1)
safSISU=safSu=SU1\,safSg=SGONE\,safApp=TWONAPP,safSi=TWONSI3,safApp=TWONAPP
        saAmfSISUHAState=QUIESCED(3)
safSISU=safSu=PL-5\,safSg=NoRed\,safApp=OpenSAF,safSi=NoRed5,safApp=OpenSAF
        saAmfSISUHAState=ACTIVE(1)
safSISU=safSu=SC-2\,safSg=2N\,safApp=OpenSAF,safSi=SC-2N,safApp=OpenSAF
        saAmfSISUHAState=STANDBY(2)
safSISU=safSu=PL-3\,safSg=NoRed\,safApp=OpenSAF,safSi=NoRed4,safApp=OpenSAF
        saAmfSISUHAState=ACTIVE(1)

syslog on active ctrl ie sc-1:
Jul  9 12:43:09 SC-1 osafamfd[3725]: WA SG not in STABLE state 
(safSg=SGONE,safApp=TWONAPP)
Jul  9 12:43:10 SC-1 osafamfd[3725]: WA SG not in STABLE state 
(safSg=SGONE,safApp=TWONAPP)
Jul  9 12:43:11 SC-1 osafamfd[3725]: WA SG not in STABLE state 
(safSg=SGONE,safApp=TWONAPP)
Jul  9 12:43:12 SC-1 osafamfd[3725]: WA SG not in STABLE state 
(safSg=SGONE,safApp=TWONAPP)
Jul  9 12:43:13 SC-1 osafamfd[3725]: WA SG not in STABLE state 
(safSg=SGONE,safApp=TWONAPP)



---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to