We are monitoring our hosts via Zabbix and take manual actions when a host fails. If a host is in state "Disconnected" or "Alert" you can declare a host as degraded via api (https://cloudstack.apache.org/api/apidocs-4.19/apis/declareHostAsDegraded.h tml) or UI (icon).
Daniel Salvador (gutoveronezi) also provided a very good explanation on 10th of April in a response to similar question. Regards, Swen -----Ursprüngliche Nachricht----- Von: Dietrich, Alex <adietr...@ussignal.com.INVALID> Gesendet: Freitag, 12. April 2024 17:46 An: users <users@cloudstack.apache.org> Betreff: Handling KVM host failure Hello All, How are folks handling KVM host failure in CloudStack? For example, when a host has a loss of power or hard power off, CloudStack takes nearly 15 minutes to detect that the host is offline. This creates a challenge as VMs are considered to be running in CloudStack during that time despite being unreachable. Is there a knob I am missing on speeding up the detection? Thanks, Alex [__tpx__]