- **status**: unassigned --> accepted
- **assigned_to**: Gary Lee


---

** [tickets:#2400] AMFD: Cached node_up message causes amfnd reboot after node 
joins cluster**

**Status:** accepted
**Milestone:** 5.1.1
**Created:** Wed Mar 29, 2017 06:05 AM UTC by Minh Hon Chau
**Last Updated:** Wed Mar 29, 2017 06:05 AM UTC
**Owner:** Gary Lee


SC Absence is enabled, restarts both SCs. After all amfnd introduce node_up and 
join cluster, cluster startup timer expires in which amfd will start 
application assignments. At this time, a retransmitted node_up message which 
could be cached in mailbox (or late coming) that makes amfd to order a node 
reboot

ar 20 15:04:46 SC-2 osafamfd[9576]: NO Receive message with event type:12, 
msg_type:31, from node:2040f, msg_id:0
Mar 20 15:04:46 SC-2 osafamfd[9576]: NO Receive message with event type:12, 
msg_type:31, from node:2030f, msg_id:0
Mar 20 15:04:46 SC-2 osafamfd[9576]: NO Receive message with event type:13, 
msg_type:32, from node:2040f, msg_id:0
Mar 20 15:04:46 SC-2 osafamfd[9576]: NO Receive message with event type:13, 
msg_type:32, from node:2030f, msg_id:0
Mar 20 15:04:46 SC-2 osafamfd[9576]: NO Received node_up_msg from all nodes
Mar 20 15:04:46 SC-2 osafamfd[9576]: NO Received node_up from 2030f: msg_id 1

Mar 20 15:04:46 SC-2 osafamfd[9576]: NO Enter restore headless cached RTAs from 
IMM
Mar 20 15:04:46 SC-2 osafamfd[9576]: NO Leave reading headless cached RTAs from 
IMM: SUCCESS
Mar 20 15:04:46 SC-2 osafamfd[9576]: NO Node 'SC-2' joined the cluster

Mar 20 15:04:49 SC-2 osafamfd[9576]: NO Received node_up from 2030f: msg_id 1
Mar 20 15:04:49 SC-2 osafamfd[9576]: NO Node 'PL-3' joined the cluster
Mar 20 15:04:49 SC-2 osafamfd[9576]: NO Received node_up from 2010f: msg_id 1
Mar 20 15:04:49 SC-2 osafamfd[9576]: NO Node 'SC-1' joined the cluster

Mar 20 15:05:00 SC-2 osafamfd[9576]: NO Cluster startup is done

Mar 20 15:05:18 SC-2 osafamfd[9576]: NO Received node_up from 2030f: msg_id 1
Mar 20 15:05:18 SC-2 osafamfd[9576]: WA Sending node reboot order to 
node:safAmfNode=PL-3,safAmfCluster=myAmfCluster, due to late node_up_msg after 
cluster startup timeout



---

Sent from sourceforge.net because opensaf-tickets@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Opensaf-tickets mailing list
Opensaf-tickets@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to