On 07.05.2021 13:36, Kyle O'Donnell wrote: > Hi Everyone. > > We've setup fencing with our ilo/idrac interfaces and things generally work > well but during some of our failover scenario testing we ran into issues when > we "failed' the switches in which those ilo/idrac interfaces were connected. > The issue was that resources were migrated away from any node with an offline > fencing device. I can see how that is desirable, but in our case this is > essentially a single point of failure. How are others managing this? >
I am not sure I understand the issue. So node did not fail and remained online but pacemaker migrated resources off this node? And what exactly "offline fencing device" means? Sounds you have some constraints that do it. You need to post logs at least from DC from the point stonith resource failed as well as your actual configuration with all constraints. > In one of our sites we have "smart" APC power strips so we can setup multiple > fencing devices, but in another site we do not. I tried increasing the > timeout= value on the fencing devices but that did not seem to work. > > Thanks, > Kyle > > > _______________________________________________ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ > _______________________________________________ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/