- **status**: assigned --> invalid
- **Comment**:
Test app crashed which caused the reboots.
---
** [tickets:#2365] AMF: Active controller went for continuous reboots when an
NPM app is upgraded with more SIs and CSIs**
**Status:** invalid
**Milestone:** 5.2.RC1
**Created:** Sat Mar 11, 2017 03:40 PM UTC by Chani Srivastava
**Last Updated:** Mon Mar 13, 2017 09:19 AM UTC
**Owner:** Praveen
**Environment details**
OS : Suse 64bit
Changeset : 8634 ( 5.2.FC)
Setup : 4 nodes ( 2 controllers and 2 payloads / no PBE )
**Steps followed & Observed behaviour**
1. Import attached xml
2. Bring up the attached NPM.sh application
3. Execute attached campaign22.xml to upgrade the application
Campaign22.xml adds more SIs and CSIs ( i.e work ) and assign it to SUs which
can handle more work and also assign to spare SUs
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO Restarting a component of
'safSu=SU1,safSg=SGONE,safApp=NPMAPP' (comp restart count: 1)
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO
'safComp=COMP3SU1NPMAPP,safSu=SU1,safSg=SGONE,safApp=NPMAPP' faulted due to
'avaDown' : Recovery is 'componentRestart'
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO Restarting a component of
'safSu=SU1,safSg=SGONE,safApp=NPMAPP' (comp restart count: 2)
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO
'safComp=COMP2SU1NPMAPP,safSu=SU1,safSg=SGONE,safApp=NPMAPP' faulted due to
'avaDown' : Recovery is 'componentRestart'
|
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO Performing failover of
'safSu=SU1,safSg=SGONE,safApp=NPMAPP' (SU failover count: 1)
Oct 2 18:03:43 OSAF-SC1 osafamfnd[6292]: NO
'safComp=COMP1SU1NPMAPP,safSu=SU1,safSg=SGONE,safApp=NPMAPP' recovery action
escalated from 'componentRestart' to 'suFailover'
|
Oct 2 18:03:47 OSAF-SC1 osafamfnd[6292]: NO
'safComp=COMP3SU1NPMAPP,safSu=SU1,safSg=SGONE,safApp=NPMAPP' recovery action
escalated from 'componentRestart' to 'nodeFailover'
Oct 2 18:03:47 OSAF-SC1 osafamfnd[6292]: NO
'safComp=COMP3SU1NPMAPP,safSu=SU1,safSg=SGONE,safApp=NPMAPP' faulted due to
'avaDown' : Recovery is 'nodeFailover'
|
Oct 2 18:03:49 OSAF-SC1 osafamfnd[6292]: NO Received reboot order, ordering
reboot now!
Oct 2 18:03:49 OSAF-SC1 osafamfnd[6292]: Rebooting OpenSAF NodeId = 131343 EE
Name = , Reason: Received reboot order, OwnNodeId = 131343, SupervisionTime = 60
Oct 2 18:03:49 OSAF-SC1 opensaf_reboot: Rebooting local node; timeout=60
I will share the logs and scripts offline as they are hude in size
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets