---

** [tickets:#3136] amf: incorrect node failover state on standby amfd**

**Status:** assigned
**Milestone:** 5.20.01
**Created:** Mon Dec 30, 2019 10:05 AM UTC by Thuan Tran
**Last Updated:** Mon Dec 30, 2019 10:05 AM UTC
**Owner:** Thuan Tran


Reboot Standby SC and some PLs may lead to incorrect node failover state on 
standby amfd.
Because PLs joined during cold sync and standby amfd drop checkpoint data of 
node failover state.
~~~
2019-12-19T04:35:21.374+01:00 SC-1 osafamfd[21833]: NO Received node_up from 
2130f: msg_id 1
2019-12-19T04:35:21.374+01:00 SC-1 osafamfd[21833]: NO Node 'PL-19' joined the 
cluster
2019-12-19T04:35:21.396+01:00 SC-1 osafamfd[21833]: NO Received node_up from 
2150f: msg_id 1
2019-12-19T04:35:21.397+01:00 SC-1 osafamfd[21833]: NO Node 'PL-21' joined the 
cluster
2019-12-19T04:35:21.416+01:00 SC-1 osafamfd[21833]: NO Received node_up from 
20e0f: msg_id 1
2019-12-19T04:35:21.416+01:00 SC-1 osafamfd[21833]: NO Node 'PL-14' joined the 
cluster

2019-12-19T04:35:21.375+01:00 SC-2 osafamfd[21809]: WA 
avsv_validate_reo_type_in_csync: unknown type 53
2019-12-19T04:35:21.398+01:00 SC-2 osafamfd[21809]: WA 
avsv_validate_reo_type_in_csync: unknown type 53
2019-12-19T04:35:21.425+01:00 SC-2 osafamfd[21809]: WA 
avsv_validate_reo_type_in_csync: unknown type 53
2019-12-19T04:35:22.545+01:00 SC-2 osafamfd[21809]: NO Cold sync complete!


2019-12-19T04:38:20.425+01:00 SC-2 osafamfd[21809]: NO Node failover timeout
2019-12-19T04:38:20.425+01:00 SC-2 osafamfd[21809]: WA Failed node 'PL-14' has 
reappeared after network separation
2019-12-19T04:38:20.425+01:00 SC-2 osafamfd[21809]: NO Node failover timeout
2019-12-19T04:38:20.425+01:00 SC-2 osafamfd[21809]: WA Failed node 'PL-19' has 
reappeared after network separation
2019-12-19T04:38:20.425+01:00 SC-2 osafamfd[21809]: NO Node failover timeout
2019-12-19T04:38:20.425+01:00 SC-2 osafamfd[21809]: WA Failed node 'PL-21' has 
reappeared after network separation
..... these messages keep repeat .....
2019-12-19T05:08:21.425+01:00 SC-2 osafamfd[21809]: WA Failed node 'PL-14' has 
reappeared after network separation
2019-12-19T05:08:21.425+01:00 SC-2 osafamfd[21809]: NO Node failover timeout
2019-12-19T05:08:21.425+01:00 SC-2 osafamfd[21809]: WA Failed node 'PL-19' has 
reappeared after network separation
2019-12-19T05:08:21.425+01:00 SC-2 osafamfd[21809]: NO Node failover timeout
2019-12-19T05:08:21.425+01:00 SC-2 osafamfd[21809]: WA Failed node 'PL-21' has 
reappeared after network separation
~~~
When Standby amfd failover become Active, amfd will order reboot these PLs 
unexpectedly.
~~~
2019-12-19T05:17:25.626+01:00 SC-2 osafamfd[21809]: NO FAILOVER StandBy --> 
Active
2019-12-19T05:17:25.640+01:00 SC-2 osafamfd[21809]: NO Failing over OpenSAF 
components only
2019-12-19T05:17:25.642+01:00 SC-2 osafamfd[21809]: NO FAILOVER StandBy --> 
Active DONE!
...
2019-12-19T05:20:21.825+01:00 SC-2 osafamfd[21809]: WA Failed node 'PL-14' has 
reappeared after network separation
2019-12-19T05:20:21.825+01:00 SC-2 osafamfd[21809]: WA Sending node reboot order
~~~



---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to