>>> Eric Robinson <eric.robin...@psmnv.com> schrieb am 02.03.2021 um 19:26 in
Nachricht
<sa2pr03mb58847e37845fc6c92bc3007efa...@sa2pr03mb5884.namprd03.prod.outlook.com>

>>  -----Original Message-----
>> From: Users <users-boun...@clusterlabs.org> On Behalf Of Digimer
>> Sent: Monday, March 1, 2021 11:02 AM
>> To: Cluster Labs - All topics related to open-source clustering welcomed
>> <users@clusterlabs.org>; Ulrich Windl <ulrich.wi...@rz.uni-regensburg.de>
>> Subject: Re: [ClusterLabs] Antw: [EXT] Re: "Error: unable to fence 
> '001db02a'"
...
>> >> Cloud fencing usually requires a higher timeout (20s reported here).
>> >>
>> >> Microsoft seems to suggest the following setup:
>> >>
>> >> # pcs property set stonith‑timeout=900
>> >
>> > But doesn't that mean the other node waits 15 minutes after stonith
>> > until it performs the first post-stonith action?
>>
>> No, it means that if there is no reply by then, the fence has failed. If
the
>> fence happens sooner, and the caller is told this, recovery begins very 
> shortly
>> after.

How would the fencing be confirmed? I don't know.


>>
> 
> Interesting. Since users often report application failure within 1-3 minutes

> and may engineers begin investigating immediately, a technician could end up

> connecting to a cluster node after the stonith command was called, and could

> conceivably bring a failed no back up manually, only to have Azure finally 
> get around to shooting it in the head. I don't suppose there's a way to 
> abort/cancel a STONITH operation that is in progress?

I think you have to decide: Let the cluster handle the problem, or let the
admin handle the problem, but preferrably not both.
I also think you cannot cancel a STONITH; you can only confirm it.

Regards,
Ulrich

...


_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users

ClusterLabs home: https://www.clusterlabs.org/

Reply via email to