Hi,

On 08/05/2025 19:14, Vicki Chen wrote:
Hi,

I'm a software developer currently researching how Corosync utilizes the Totem 
protocol token. Corosync employs the Totem protocol to monitor the health of 
cluster nodes by circulating a token among them. If a node doesn't receive the 
token within a specified timeout period (configured in corosync.conf), it's 
considered unresponsive. I'm interested in diving deeper into this mechanism. 
Is there a specific formula or method used to determine host failure detection 
time in Pacemaker? Could you provide details? Thanks!

Are you interested in Pacemaker or Corosync? For Corosync head to https://github.com/corosync/corosync/wiki/Developers#reference-documentation specially "Totem: A Reliable Ordered Delivery Protocol for Interconnected Local-Area Networks (PDF) - Deborah A. Agarwal - 1994".

Todays Corosync differs in many areas, especially using knet instead of IP multicast, but totem itself is more or less 1:1 implementation of protocol described in Deborah A. Agarwal thesis.

Regards
  Honza


best regards,

Vicki Chen





_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/developers

ClusterLabs home: https://www.clusterlabs.org/


_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/developers

ClusterLabs home: https://www.clusterlabs.org/

Reply via email to