---
** [tickets:#2365] AMF: Active controller went for continuous reboots when an
NPM app is upgraded with more SIs and CSIs**
**Status:** unassigned
**Milestone:** 5.2.RC1
**Created:** Sat Mar 11, 2017 03:40 PM UTC by Chani Srivastava
**Last Updated:** Sat Mar 11, 2017 03:40 PM UTC
**Owner:** nobody
**Environment details**
OS : Suse 64bit
Changeset : 8634 ( 5.2.FC)
Setup : 4 nodes ( 2 controllers and 2 payloads / no PBE )
**Steps followed & Observed behaviour**
1. Import attached xml
2. Bring up the attached NPM.sh application
3. Execute attached campaign22.xml to upgrade the application
Campaign22.xml adds more SIs and CSIs ( i.e work ) and assign it to SUs which
can handle more work and also assign to spare SUs
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO Restarting a component of
'safSu=SU1,safSg=SGONE,safApp=NPMAPP' (comp restart count: 1)
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO
'safComp=COMP3SU1NPMAPP,safSu=SU1,safSg=SGONE,safApp=NPMAPP' faulted due to
'avaDown' : Recovery is 'componentRestart'
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO Restarting a component of
'safSu=SU1,safSg=SGONE,safApp=NPMAPP' (comp restart count: 2)
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO
'safComp=COMP2SU1NPMAPP,safSu=SU1,safSg=SGONE,safApp=NPMAPP' faulted due to
'avaDown' : Recovery is 'componentRestart'
|
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO Performing failover of
'safSu=SU1,safSg=SGONE,safApp=NPMAPP' (SU failover count: 1)
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO
'safComp=COMP1SU1NPMAPP,safSu=SU1,safSg=SGONE,safApp=NPMAPP' recovery action
escalated from 'componentRestart' to 'suFailover'
|
Oct 2 18:03:47 OSAF-SC1 osafamfnd[6292]: NO
'safComp=COMP3SU1NPMAPP,safSu=SU1,safSg=SGONE,safApp=NPMAPP' recovery action
escalated from 'componentRestart' to 'nodeFailover'
Oct 2 18:03:47 OSAF-SC1 osafamfnd[6292]: NO
'safComp=COMP3SU1NPMAPP,safSu=SU1,safSg=SGONE,safApp=NPMAPP' faulted due to
'avaDown' : Recovery is 'nodeFailover'
|
Oct 2 18:03:49 OSAF-SC1 osafamfnd[6292]: NO Received reboot order, ordering
reboot now!
Oct 2 18:03:49 OSAF-SC1 osafamfnd[6292]: Rebooting OpenSAF NodeId = 131343 EE
Name = , Reason: Received reboot order, OwnNodeId = 131343, SupervisionTime = 60
Oct 2 18:03:49 OSAF-SC1 opensaf_reboot: Rebooting local node; timeout=60
I will share the logs and scripts offline as they are hude in size
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Announcing the Oxford Dictionaries API! The API offers world-renowned
dictionary content that is easy and intuitive to access. Sign up for an
account today to start using our lexical data to power your apps and
projects. Get started today and enter our developer competition.
http://sdm.link/oxford
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets