- **status**: review --> fixed
- **Comment**:
commit 4de5722f578966da5828340df9f9c0a8cb856ab7
Author: thuan.tran <thuan.t...@dektech.com.au>
Date: Wed Nov 4 17:45:19 2020 +0700
fm: fix unexpected node reboot [#3230]
- Only reboot if RDE role is not ACTIVE because
there is a case that node just promote to ACTIVE.
---
** [tickets:#3230] fm: unexpected reboot node**
**Status:** fixed
**Milestone:** 5.20.11
**Created:** Wed Nov 04, 2020 09:50 AM UTC by Thuan Tran
**Last Updated:** Wed Nov 04, 2020 10:35 AM UTC
**Owner:** Thuan Tran
Unexpected reboot node by FM
~~~
2020-10-27 14:36:54.200 SC-1 osafrded[156]: NO Lost connectivity to consensus
service
2020-10-27 14:36:54.200 SC-1 osafrded[156]: Quick local node rebooting, Reason:
Lost connectivity to consensus service. Rebooting this node
2020-10-27 14:36:58.488 SC-2 osaffmd[168]: ER Unable to set active controller
in consensus service
2020-10-27 14:36:58.488 SC-2 osaffmd[168]: Quick local node rebooting, Reason:
Unable to set active controller in consensus service
2020-10-27 14:36:58.529 SC-2 opensaf_reboot: Do quick local node reboot
2020-10-27 14:36:58.871 SC-2 osafdtmd[126]: NO Established contact with 'PL-3'
2020-10-27 14:36:58.876 SC-2 osafdtmd[126]: NO Established contact with 'SC-1'
2020-10-27 14:36:58.877 SC-2 osafrded[156]: NO Peer up on node 0x2010f
2020-10-27 14:36:59.842 SC-2 osafrded[156]: NO Got peer info response from node
0x2010f with role Undefined
2020-10-27 14:36:59.844 SC-2 osafrded[156]: NO Peer down on node 0x2010f
2020-10-27 14:37:00.783 SC-2 osafrded[156]: NO Peer up on node 0x2010f
2020-10-27 14:37:00.785 SC-2 osafrded[156]: NO Got peer info response from node
0x2010f with role ACTIVE
~~~
SCs reboot due to lost connection to consensus service (arbitrator) but somehow
SC-2 slow reboot some seconds.
SC-1 reboot up and promote to Active but FM reboot node unexpectedly.
~~~
2020-10-27 14:36:59.841 SC-1 osafrded[163]: NO Peer up on node 0x2020f
2020-10-27 14:36:59.842 SC-1 osafrded[163]: NO Got peer info response from node
0x2020f with role STANDBY
2020-10-27 14:36:59.843 SC-1 osafrded[163]: NO RDE role set to QUIESCED
2020-10-27 14:36:59.844 SC-1 osafrded[163]: NO Giving up election against
0x2020f with role STANDBY. My role is now QUIESCED
2020-10-27 14:37:00.084 SC-1 /tcp.plugin: obtained lock at arbitrator
2020-10-27 14:37:00.098 SC-1 osafrded[163]: NO Active controller set to SC-1
2020-10-27 14:37:00.098 SC-1 osafrded[163]: NO Running
'/usr/local/lib/opensaf/opensaf_sc_active' with 0 argument(s)
2020-10-27 14:37:00.781 SC-1 osafrded[163]: NO Switched to ACTIVE from QUIESCED
2020-10-27 14:37:02.893 SC-1 osaffmd[175]: NO AVD down on: 2020f
2020-10-27 14:37:02.893 SC-1 osaffmd[175]: NO AMFND down on: 2020f
2020-10-27 14:37:02.893 SC-1 osaffmd[175]: NO FM down on: 2020f
2020-10-27 14:37:02.893 SC-1 osaffmd[175]: NO IMMD down on: 2020f
2020-10-27 14:37:02.893 SC-1 osaffmd[175]: NO IMMND down on: 2020f
2020-10-27 14:37:02.893 SC-1 osaffmd[175]: NO Core services went down on
node_id: 2020f
2020-10-27 14:37:02.893 SC-1 osaffmd[175]: NO Current role: ACTIVE
2020-10-27 14:37:02.895 SC-1 osaffmd[175]: Rebooting OpenSAF NodeId = 0 EE Name
= No EE Mapped, Reason: Failover occurred, but this node is not yet ready,
OwnNodeId = 131343, SupervisionTime = 60
2020-10-27 14:37:02.895 SC-1 osafrded[163]: NO Peer down on node 0x2020f
2020-10-27 14:37:02.895 SC-1 osafimmd[188]: NO MDS event from svc_id 25
(change:4, dest:568511936069789)
2020-10-27 14:37:02.895 SC-1 osafimmd[188]: NO MDS event from svc_id 25
(change:4, dest:567412424442013)
2020-10-27 14:37:02.910 SC-1 opensaf_reboot: Rebooting local node; timeout=60
~~~
---
Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets