>>> damiano giuliani <damianogiulian...@gmail.com> schrieb am 08.10.2021 um 
>>> 15:00
in Nachricht
<CAG=zyno0ieawqearuzh2cdmy-6kzf3dhbbubr0iiurf47bg...@mail.gmail.com>:
> Hi Guys, after months of suddens  unexpected failovers, checking every
> corners and types of logs without any luck, cuz no logs and no reasons or

If you have no logs, you should cleaerly check your configuration.

...
> So it turn out that a lil bit of swap was used and i suspect corosync
> process were swapped to disks creating lag where 1s default corosync
> timeout was not enough.

BTW: Do you use thing provisioned swap (just in case)?

> So it is, swap doesnt log anything and moving process to allocated ram to
> swap take times more that 1s default timeout (probably many many mores).

When swapping to/from SSD, it's hard to believe that it takes so long that the 
cluster nodes would be fenced.
Also code that is periodically referenced won't be swapped, specificall if you 
have plenty of RAM.

> i fix it changing the swappiness of each servers to 10 (at minimum)
> avoinding the corosync process could swap.

Do you have a proof that swap was the problem?

...

Regards,
Ulrich



_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users

ClusterLabs home: https://www.clusterlabs.org/

Reply via email to