---
** [tickets:#1500] AMF: Fails to repair failed SU when failed component has
attribute saAmfCompDisableRestart=1**
**Status:** unassigned
**Milestone:** 4.5.2
**Created:** Thu Sep 24, 2015 01:16 PM UTC by Quyen Dao
**Last Updated:** Thu Sep 24, 2015 01:16 PM UTC
**Owner:** nobody
**Attachments:**
-
[AppConfig-2N.xml](https://sourceforge.net/p/opensaf/tickets/1500/attachment/AppConfig-2N.xml)
(9.2 kB; text/xml)
**Steps to reproduce**
* Load the attached model
* Change saAmfCompDisableRestart=1 on the component in SU1
* Change to component CLC-CLI script to return 1 for cleanup
* Trigger component termination failed by killing the component in SU1
* Repair SU1
-> Result: SU1 fails to repair.
**command log**
root@PL-3:~# immcfg -a saAmfCompDisableRestart=1
safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1
root@PL-3:~# amf-state su all safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1
safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1
saAmfSUAdminState=UNLOCKED(1)
saAmfSUOperState=ENABLED(1)
saAmfSUPresenceState=INSTANTIATED(3)
saAmfSUReadinessState=IN-SERVICE(2)
root@PL-3:~# pkill amf_demo
root@PL-3:~# amf-state su all safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1
safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1
saAmfSUAdminState=UNLOCKED(1)
saAmfSUOperState=DISABLED(2)
saAmfSUPresenceState=TERMINATION-FAILED(7)
saAmfSUReadinessState=OUT-OF-SERVICE(1)
root@PL-3:~#
root@PL-3:~# amf-adm repaired safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1
root@PL-3:~# amf-state su all safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1
safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1
saAmfSUAdminState=UNLOCKED(1)
saAmfSUOperState=DISABLED(2)
saAmfSUPresenceState=TERMINATION-FAILED(7)
saAmfSUReadinessState=OUT-OF-SERVICE(1)
root@PL-3:~#
**syslog**
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO saAmfCompDisableRestart changed to 1
for 'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1'
Sep 24 20:14:05 PL-3 osafimmnd[389]: NO Ccb 3 COMMITTED (immcfg_PL-3_641)
Sep 24 20:14:05 PL-3 amf_demo[585]: exiting (caught term signal)
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO saAmfCompDisableRestart is true for
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1'
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO recovery action 'comp restart'
escalated to 'comp failover'
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO saAmfSUFailover is true for
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1'
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO SU failover probation timer started
(timeout: 1200000000000 ns)
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO Performing failover of
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' (SU failover count: 1)
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' recovery action
escalated from 'componentRestart' to 'suFailover'
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' faulted due to
'avaDown' : Recovery is 'suFailover'
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO Terminating components of
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1'(abruptly & unordered)
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' Presence State INSTANTIATED =>
TERMINATING
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO Cleanup of
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' failed
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO Reason:'Exec of script success, but
script exits with non-zero status'
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO Exit code: 1
Sep 24 20:14:05 PL-3 osafamfnd[417]: WA
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' Presence State
TERMINATING => TERMINATION_FAILED
Sep 24 20:14:05 PL-3 osafamfnd[417]: NO
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' Presence State TERMINATING =>
TERMINATION_FAILED
Sep 24 20:14:14 PL-3 osafamfnd[417]: ER ncsmds_api for 0 FAILED,
dest=2030f00000249
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO Repair request for
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1'
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' Presence State TERMINATION_FAILED =>
UNINSTANTIATED
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' Presence State UNINSTANTIATED =>
INSTANTIATING
Sep 24 20:14:22 PL-3 amf_demo[734]:
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' started
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' Presence State INSTANTIATING =>
INSTANTIATED
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO Assigning
'safSi=AmfDemo,safApp=AmfDemo1' STANDBY to
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1'
Sep 24 20:14:22 PL-3 amf_demo[734]: saAmfHealthcheckStart FAILED - 14
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO saAmfCompDisableRestart is true for
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1'
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO recovery action 'comp restart'
escalated to 'comp failover'
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO saAmfSUFailover is true for
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1'
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO Performing failover of
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' (SU failover count: 2)
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' recovery action
escalated from 'componentRestart' to 'suFailover'
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' faulted due to
'avaDown' : Recovery is 'suFailover'
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO Terminating components of
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1'(abruptly & unordered)
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' Presence State INSTANTIATED =>
TERMINATING
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO Cleanup of
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' failed
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO Reason:'Exec of script success, but
script exits with non-zero status'
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO Exit code: 1
Sep 24 20:14:22 PL-3 osafamfnd[417]: WA
'safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' Presence State
TERMINATING => TERMINATION_FAILED
Sep 24 20:14:22 PL-3 osafamfnd[417]: NO
'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' Presence State TERMINATING =>
TERMINATION_FAILED
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.------------------------------------------------------------------------------
Monitor Your Dynamic Infrastructure at Any Scale With Datadog!
Get real-time metrics from all of your servers, apps and tools
in one place.
SourceForge users - Click here to start your Free Trial of Datadog now!
http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets