- **status**: review --> fixed
- **Comment**:

commit c0b9308aed27a1cc32d660194f079abda545a375
Author: thuan.tran <[email protected]>
Date:   Thu Mar 5 16:24:34 2020 +0700

    osaf: enhance vm frozen detection in tcp.plugin [#3164]
    
    - Active SC will reboot if arb time somehow has big gap b/w heartbeats
    in watch takeover request. Active SC may still OK but be rebooted 
unexpectedly.
    - Enhance VM was frozen detection base on arb time and local time counter.





---

** [tickets:#3164] osaf: enhance vm frozen detection in tcp.plugin**

**Status:** fixed
**Milestone:** 5.20.05
**Created:** Thu Mar 05, 2020 09:20 AM UTC by Thuan Tran
**Last Updated:** Thu Mar 05, 2020 09:58 AM UTC
**Owner:** Thuan Tran


If a vm is frozen for real, we will have the diff of two timestamp reading at 
arb greater than self.timeout
but if the diff of two timestamp reading at arb greater than self.timeout, it 
is not 100% sure the vm is frozen.

It is correct to reboot the active SC if the active SC's VM is frozen for real, 
since active SC loses 'real' connectivity towards arb (TCP) and peer SC. The 
problem is that tracking timestamp at arb does not differentiate a real frozen 
vm or a high load at arbitrator node.

If it is only the node hosting arbi being overloaded, it is not ok to reboot 
the active SC.


---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to