This issue will be closed with following updated in AMF PR doc.
"Even if configuration supports, AMF will skip suRestart recovery during
recovery escalations in the following cases :
1)When saAmfSUPresenceState of failed SU is SA_AMF_PRESENCE_INSTANTIATING or
SA_AMF_PRESENCE_RESTARTING.
2)When assignments for some or all SIs are pending on the SU."
This behavior seems justified because if escalation has reached to surestart
level, it means some or the other component is failing in the SU continuously,
due to which SU is not able to get INSTANTIATED or unable to take assignments
at all. In such a situation, fail-over (component/SU/node fail-over) should be
done as soon as possible.
Comments?
---
** [tickets:#294] Error escalation for a component is going from COMPONENT
RESTART to SU FAILOVER without SU RESTART during si-swap operation**
**Status:** assigned
**Created:** Wed May 22, 2013 11:24 AM UTC by Nagendra Kumar
**Last Updated:** Fri Oct 04, 2013 12:19 PM UTC
**Owner:** Praveen
Migrated from http://devel.opensaf.org/ticket/2557
Changeset: 3406
Configuration:
2N configuration with 2 SUs, one on PL-4 and one on PL-5.
SG contains 7 SIs
Each SU is having 5 Components.
Component 3 of SU1 is faulty and rejects the Active assignment and its recovery
policy is COMP RESTART
SU1 is brought up as Standby and SU2 is brought up as Active
When SI-SWAP admin operation is triggered on SI3, COMP3 goes for recovery as it
does not accept the Active assignment
The recovery goes from COMPONENT RESTART to SU FAILOVER without SU RESTART
The following is the syslog output of PL-4:
amf-adm si-swap safSi=SI3,safApp=test2nApp
Mar 6 16:32:32 SLES11-SLOT-1 osafamfnd[2848]: Assigning
'safSi=SI1,safApp=test2nApp' ACTIVE to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:32:42 SLES11-SLOT-1 osafamfnd[2848]:
'safComp=COMP3,safSu=SU1,safSg=SG,safApp=test2nApp' faulted due to
'csiSetcallbackTimeout(10)' : Recovery is 'componentRestart(2)'
Mar 6 16:32:42 SLES11-SLOT-1 logger: CLC-CLI spawnd cleanup for
safComp=COMP3,safSu=SU1,safSg=SG,safApp=test2nApp
Mar 6 16:32:42 SLES11-SLOT-1 logger: CLC-CLI spawnd instantiate for
safComp=COMP3,safSu=SU1,safSg=SG,safApp=test2nApp
Mar 6 16:32:52 SLES11-SLOT-1 osafamfnd[2848]:
'safComp=COMP3,safSu=SU1,safSg=SG,safApp=test2nApp' faulted due to
'csiSetcallbackTimeout(10)' : Recovery is 'componentRestart(2)'
Mar 6 16:32:52 SLES11-SLOT-1 logger: CLC-CLI spawnd cleanup for
safComp=COMP3,safSu=SU1,safSg=SG,safApp=test2nApp
Mar 6 16:32:52 SLES11-SLOT-1 logger: CLC-CLI spawnd instantiate for
safComp=COMP3,safSu=SU1,safSg=SG,safApp=test2nApp
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]:
'safComp=COMP3,safSu=SU1,safSg=SG,safApp=test2nApp' faulted due to
'csiSetcallbackTimeout(10)' : Recovery is 'suFailover(11)'
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Assigned
'safSi=SI1,safApp=test2nApp' ACTIVE to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]:
'safSu=SU1,safSg=SG,safApp=test2nApp' Presence State INSTANTIATED => TERMINATING
Mar 6 16:33:02 SLES11-SLOT-1 logger: CLC-CLI spawnd cleanup for
safComp=COMP3,safSu=SU1,safSg=SG,safApp=test2nApp
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Removing 'all SIs' from
'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Removed
'safSi=SI1,safApp=test2nApp' from 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Removed
'safSi=SI2,safApp=test2nApp' from 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Removed
'safSi=SI3,safApp=test2nApp' from 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Removed
'safSi=SI4,safApp=test2nApp' from 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Removed
'safSi=SI5,safApp=test2nApp' from 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Removed
'safSi=SI6,safApp=test2nApp' from 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Removed
'safSi=SI7,safApp=test2nApp' from 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:02 SLES11-SLOT-1 logger: CLC-CLI spawnd instantiate for
safComp=COMP3,safSu=SU1,safSg=SG,safApp=test2nApp
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]:
'safSu=SU1,safSg=SG,safApp=test2nApp' Presence State TERMINATING => INSTANTIATED
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Assigning
'safSi=SI1,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:02 SLES11-SLOT-1 osafamfnd[2848]: Assigning
'safSi=SI2,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigning
'safSi=SI3,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigning
'safSi=SI4,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigning
'safSi=SI5,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigning
'safSi=SI6,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigning
'safSi=SI7,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigned
'safSi=SI1,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigned
'safSi=SI2,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigned
'safSi=SI3,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigned
'safSi=SI4,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigned
'safSi=SI5,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigned
'safSi=SI6,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
Mar 6 16:33:03 SLES11-SLOT-1 osafamfnd[2848]: Assigned
'safSi=SI7,safApp=test2nApp' STANDBY to 'safSu=SU1,safSg=SG,safApp=test2nApp'
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets