- **status**: assigned --> review
---
** [tickets:#3143] osaf: quick reboot need stop osafamfd**
**Status:** review
**Milestone:** 5.20.01
**Created:** Thu Jan 16, 2020 02:42 AM UTC by Thuan Tran
**Last Updated:** Thu Jan 16, 2020 02:42 AM UTC
**Owner:** Thuan Tran
Under Headless enable, split-brain recovery by quick reboot sometimes not quick
enough
then amfd has chance to communicate with payloads cause unexpected reboot
order.
~~~
2020-01-15 11:19:38.238 SC-2 osafrded[151]: Quick local node rebooting, Reason:
Split-brain detected
2020-01-15 11:19:38.239 SC-2 osafsmfd[261]: WA saClmClusterNodeGet failed,
rc=SA_AIS_ERR_NOT_EXIST (12)
2020-01-15 11:19:38.239 SC-2 osafsmfd[261]: WA proc_mds_info: SMFND UP failed
2020-01-15 11:19:38.261 SC-2 opensaf_reboot: Do quick local node reboot
2020-01-15 11:19:38.425 SC-2 osafdtmd[126]: NO Lost contact with 'SC-1'
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO AVD down on: 2010f
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO AMFND down on: 2010f
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO FM down on: 2010f
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO IMMD down on: 2010f
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO IMMND down on: 2010f
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO Core services went down on
node_id: 2010f
2020-01-15 11:19:38.427 SC-2 osafimmd[172]: NO MDS event from svc_id 24
(change:1, dest:13)
2020-01-15 11:19:38.428 SC-2 osafimmd[172]: NO MDS event from svc_id 24
(change:6, dest:13)
2020-01-15 11:19:38.428 SC-2 osafimmd[172]: NO MDS event from svc_id 25
(change:4, dest:564113889558712)
2020-01-15 11:19:38.428 SC-2 osafamfd[229]: WA Ignore 'SC-1' amfnd down event
2020-01-15 11:19:38.428 SC-2 osafimmd[172]: WA IMMD lost contact with peer IMMD
(NCSMDS_RED_DOWN)
2020-01-15 11:19:38.428 SC-2 osafimmnd[184]: NO Recent fevs:
2020-01-15 11:19:38.428 SC-2 osafimmnd[184]: NO
<1571>[IMMND_EVT_A2ND_OI_OBJ_MODIFY ->
safAmfNode=PL-4,safAmfCluster=myAmfCluster]
2020-01-15 11:19:38.428 SC-2 osafimmnd[184]: NO
<1572>[IMMND_EVT_A2ND_OI_OBJ_MODIFY ->
safAmfNode=PL-5,safAmfCluster=myAmfCluster]
2020-01-15 11:19:38.428 SC-2 osafimmnd[184]: NO
<1573>[IMMND_EVT_D2ND_DISCARD_NODE -> node_id:2010f]
2020-01-15 11:19:38.429 SC-2 osafimmnd[184]: NO
<1569>[IMMND_EVT_A2ND_OI_OBJ_CREATE -> SaAmfCSIAssignment]
2020-01-15 11:19:38.429 SC-2 osafimmnd[184]: NO
<1570>[IMMND_EVT_A2ND_OI_OBJ_MODIFY ->
safAmfNode=SC-2,safAmfCluster=myAmfCluster]
2020-01-15 11:19:38.429 SC-2 osafimmnd[184]: NO Global discard node received
for nodeId:2010f pid:0
2020-01-15 11:19:38.429 SC-2 osafrded[151]: NO Peer down on node 0x2010f
2020-01-15 11:19:38.429 SC-2 osafimmd[172]: NO MDS event from svc_id 25
(change:4, dest:566312912814232)
2020-01-15 11:19:38.429 SC-2 osaffmd[161]: NO Current role: ACTIVE
2020-01-15 11:19:38.429 SC-2 osaffmd[161]: Rebooting OpenSAF NodeId = 131343 EE
Name = , Reason: Received Node Down for peer controller, OwnNodeId = 131599,
SupervisionTime = 60
2020-01-15 11:19:38.429 SC-2 osafamfd[229]: WA avd_msg_sanity_chk: invalid msg
id 36, msg type 8, from 2030f should be 1
2020-01-15 11:19:38.429 SC-2 osafamfd[229]: WA avd_msg_sanity_chk: reboot node
2030f to recover it
2020-01-15 11:19:38.429 SC-2 osafamfd[229]: WA avd_msg_sanity_chk: invalid msg
id 37, msg type 8, from 2030f should be 1
2020-01-15 11:19:38.429 SC-2 osafamfd[229]: WA avd_msg_sanity_chk: reboot node
2030f to recover it
~~~
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets