On Wed, Oct 03, 2012 at 04:55:55PM +0000, Dietmar Maurer wrote: > > The difficult cases, which I think you're seeing, are partitions where > > no group has quorum, e.g. 2/2. In this case we do nothing, and the > > user has to resolve it by resetting some of the nodes > > The problem with that is that those 'difficult' cases are very likely. > For example a switch reboot results in that state if you do not have > redundant network (yes, I know that this setup is simply wrong). > > And things get worse, because it is not possible to reboot such nodes, > because rgmanager shutdown simply hangs. Is there any way to avoid that, > so that it is at least possible to reboot those nodes?
Fabio's checkquorum script will reboot nodes that loose quorum.