Analyses:
1)Quiesced assignments were given to SI2-SI4 in SU1
2)AMFD received quiesced responses for SI2-Si4 and gives quiesced for SI1 in
SU1.
3)AMFD received quiesced response for SI1, gives active for SI1 to SU2.
4)AMFD receives active response for SU1 from SU2 and gives active for SI2-SI4
in SU2
5)AMFD receives active response for SI2-SI4 from SU2 and sends standby in SU1
6)AMFD received disabled event from AMFND for SU1, AMF deleted all SUSIs for
this SU1 and perform auto repair for SU1.
7)AMFD gets enabled event for SU1, AMFD sends instantiation message for SU1.
8) AMFD gets enabled events from SU1 when it gets instantiation.
Here in avd_sg_2n_su_insvc_func(), AMFD did not run new assignment logic
because SG is unstable.
This handling is not in place since long time. Already there is a ticket
because of this missing handling but with nodefailover as recovery. Since
nodefailover is like SU-failover with recovery covering whole node, so failed
SU will be handled same way both in SUfailover and Nodefailover from assignment
perspective.
As a fix, when AMFD gets disbaled event for SU1 having standby assignment, then
AMFD should make SG stable before deleting the SUSIs in case of nodefailover
and SU-failover. Also
Based on above analysis, old ticket #309 will be updated for su-failover case
also and this ticket will be marked duplicate.
---
** [tickets:#668] si-swap failes after a fault**
**Status:** duplicate
**Created:** Tue Dec 17, 2013 04:56 PM UTC by surender khetavath
**Last Updated:** Mon Dec 30, 2013 08:33 AM UTC
**Owner:** Praveen
changeset : 4733
model : 2n
configuration : 1App,1SG,2SUs with 4comps each, 4SIs with 1 CSI each
si-si deps configured as SI1 sponsor for SI2,3,4 resp
SU1 is active, SU2 is standby.
SU1 is mapped to pl-3 and SU2 to pl-4
saAmfSGAutoRepair=1(True)
SuFailover=1(True)
Test:
1. perform SI swap
2. sponsor SI timesout the new standby assignment
3. Again perform SI swap. This fails
console output:
amf-adm si-swap safSi=SI1,safApp=test2nApp
error - saImmOmAdminOperationInvoke_2 admin-op RETURNED:
SA_AIS_ERR_BAD_OPERATION (20)
error-string: SI Swap of safSi=SI1,safApp=test2nApp failed
/var/log/messages on SC-1 show
Dec 17 22:20:29 SC-1 osafamfd[7249]: ER safSi=SI1,safApp=test2nApp SWAP failed
- SG not stable (1)
/var/log/messages on PL-3:
Dec 17 22:17:19 PL-3 osafamfnd[3430]: ER cannot unlink failed state file
/var/run/opensaf/amf_failed_state: No such file or directory
Dec 17 22:17:19 PL-3 osafamfnd[3430]: NO 'safSu=SU1,safSg=SG,safApp=test2nApp'
Presence State UNINSTANTIATED => INSTANTIATING
Dec 17 22:17:19 PL-3 logger: CLC-CLI spawnd instantiate for
safComp=COMP4,safSu=SU1,safSg=SG,safApp=test2nApp
Dec 17 22:17:19 PL-3 logger: CLC-CLI spawnd instantiate for
safComp=COMP3,safSu=SU1,safSg=SG,safApp=test2nApp
Dec 17 22:17:19 PL-3 logger: CLC-CLI spawnd instantiate for
safComp=COMP2,safSu=SU1,safSg=SG,safApp=test2nApp
Dec 17 22:17:19 PL-3 logger: CLC-CLI spawnd instantiate for
safComp=COMP1,safSu=SU1,safSg=SG,safApp=test2nApp
Dec 17 22:17:19 PL-3 osafamfnd[3430]: NO 'safSu=SU1,safSg=SG,safApp=test2nApp'
Presence State INSTANTIATING => INSTANTIATED
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT
organizations don't have a clear picture of how application performance
affects their revenue. With AppDynamics, you get 100% visibility into your
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets