- **status**: review --> fixed
- **Comment**:
commit c0b9308aed27a1cc32d660194f079abda545a375
Author: thuan.tran <[email protected]>
Date: Thu Mar 5 16:24:34 2020 +0700
osaf: enhance vm frozen detection in tcp.plugin [#3164]
- Active SC will reboot if arb time somehow has big gap b/w heartbeats
in watch takeover request. Active SC may still OK but be rebooted
unexpectedly.
- Enhance VM was frozen detection base on arb time and local time counter.
---
** [tickets:#3164] osaf: enhance vm frozen detection in tcp.plugin**
**Status:** fixed
**Milestone:** 5.20.05
**Created:** Thu Mar 05, 2020 09:20 AM UTC by Thuan Tran
**Last Updated:** Thu Mar 05, 2020 09:58 AM UTC
**Owner:** Thuan Tran
If a vm is frozen for real, we will have the diff of two timestamp reading at
arb greater than self.timeout
but if the diff of two timestamp reading at arb greater than self.timeout, it
is not 100% sure the vm is frozen.
It is correct to reboot the active SC if the active SC's VM is frozen for real,
since active SC loses 'real' connectivity towards arb (TCP) and peer SC. The
problem is that tracking timestamp at arb does not differentiate a real frozen
vm or a high load at arbitrator node.
If it is only the node hosting arbi being overloaded, it is not ok to reboot
the active SC.
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list._______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets