- **status**: assigned --> review


---

** [tickets:#3143] osaf: quick reboot need stop osafamfd**

**Status:** review
**Milestone:** 5.20.01
**Created:** Thu Jan 16, 2020 02:42 AM UTC by Thuan Tran
**Last Updated:** Thu Jan 16, 2020 02:42 AM UTC
**Owner:** Thuan Tran


Under Headless enable, split-brain recovery by quick reboot sometimes not quick 
enough
then amfd has chance to communicate with payloads  cause unexpected reboot 
order.
~~~
2020-01-15 11:19:38.238 SC-2 osafrded[151]: Quick local node rebooting, Reason: 
Split-brain detected
2020-01-15 11:19:38.239 SC-2 osafsmfd[261]: WA saClmClusterNodeGet failed, 
rc=SA_AIS_ERR_NOT_EXIST (12)
2020-01-15 11:19:38.239 SC-2 osafsmfd[261]: WA proc_mds_info: SMFND UP failed
2020-01-15 11:19:38.261 SC-2 opensaf_reboot: Do quick local node reboot
2020-01-15 11:19:38.425 SC-2 osafdtmd[126]: NO Lost contact with 'SC-1'
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO AVD down on: 2010f
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO AMFND down on: 2010f
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO FM down on: 2010f
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO IMMD down on: 2010f
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO IMMND down on: 2010f
2020-01-15 11:19:38.427 SC-2 osaffmd[161]: NO Core services went down on 
node_id: 2010f
2020-01-15 11:19:38.427 SC-2 osafimmd[172]: NO MDS event from svc_id 24 
(change:1, dest:13)
2020-01-15 11:19:38.428 SC-2 osafimmd[172]: NO MDS event from svc_id 24 
(change:6, dest:13)
2020-01-15 11:19:38.428 SC-2 osafimmd[172]: NO MDS event from svc_id 25 
(change:4, dest:564113889558712)
2020-01-15 11:19:38.428 SC-2 osafamfd[229]: WA Ignore 'SC-1' amfnd down event
2020-01-15 11:19:38.428 SC-2 osafimmd[172]: WA IMMD lost contact with peer IMMD 
(NCSMDS_RED_DOWN)
2020-01-15 11:19:38.428 SC-2 osafimmnd[184]: NO Recent fevs:
2020-01-15 11:19:38.428 SC-2 osafimmnd[184]: NO 
<1571>[IMMND_EVT_A2ND_OI_OBJ_MODIFY -> 
safAmfNode=PL-4,safAmfCluster=myAmfCluster]
2020-01-15 11:19:38.428 SC-2 osafimmnd[184]: NO 
<1572>[IMMND_EVT_A2ND_OI_OBJ_MODIFY -> 
safAmfNode=PL-5,safAmfCluster=myAmfCluster]
2020-01-15 11:19:38.428 SC-2 osafimmnd[184]: NO 
<1573>[IMMND_EVT_D2ND_DISCARD_NODE -> node_id:2010f]
2020-01-15 11:19:38.429 SC-2 osafimmnd[184]: NO 
<1569>[IMMND_EVT_A2ND_OI_OBJ_CREATE -> SaAmfCSIAssignment]
2020-01-15 11:19:38.429 SC-2 osafimmnd[184]: NO 
<1570>[IMMND_EVT_A2ND_OI_OBJ_MODIFY -> 
safAmfNode=SC-2,safAmfCluster=myAmfCluster]
2020-01-15 11:19:38.429 SC-2 osafimmnd[184]: NO Global discard node received 
for nodeId:2010f pid:0
2020-01-15 11:19:38.429 SC-2 osafrded[151]: NO Peer down on node 0x2010f
2020-01-15 11:19:38.429 SC-2 osafimmd[172]: NO MDS event from svc_id 25 
(change:4, dest:566312912814232)
2020-01-15 11:19:38.429 SC-2 osaffmd[161]: NO Current role: ACTIVE
2020-01-15 11:19:38.429 SC-2 osaffmd[161]: Rebooting OpenSAF NodeId = 131343 EE 
Name = , Reason: Received Node Down for peer controller, OwnNodeId = 131599, 
SupervisionTime = 60
2020-01-15 11:19:38.429 SC-2 osafamfd[229]: WA avd_msg_sanity_chk: invalid msg 
id 36, msg type 8, from 2030f should be 1
2020-01-15 11:19:38.429 SC-2 osafamfd[229]: WA avd_msg_sanity_chk: reboot node 
2030f to recover it
2020-01-15 11:19:38.429 SC-2 osafamfd[229]: WA avd_msg_sanity_chk: invalid msg 
id 37, msg type 8, from 2030f should be 1
2020-01-15 11:19:38.429 SC-2 osafamfd[229]: WA avd_msg_sanity_chk: reboot node 
2030f to recover it
~~~


---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to