Re: [Cluster-devel] cluster4 dlm dlm_stonith ??? should it really fence by turning node off?
On Sat, Nov 03, 2012 at 03:58:28PM +0100, Jacek Konieczny wrote: Hello, The dlm_stonith fencing helper is really convenient when Pacemaker is in use. Though, it doesn't quite work as I would expect ??? when fencing is needed it requests a node to be turned off instead of rebooting. And it doesn't handle unfencing ??? so automatic recovery is not possible (rebooted node could join the cluster cleanly later, provided quorum handling is properly configured in the cluster stack). Preferably this behaviour should be configurable. I have hacked a work-around by (ab)using argv[0] ??? when 'dlm_stonith' is called as 'dlm_stonith_reboot' the node would be rebooted instead of halting ??? this works for me well-enough, but I don't think this is the right solution. Could you send the patch? Do you think the patch is not right or reboot is not right? If the later, what do you think is wrong with reboot? Any ideas how to solve that properly? An argument for the helper to be included in the config file? Or, maybe, just change the default behaviour? Greets, Jacek
Re: [Cluster-devel] cluster4 dlm dlm_stonith ??? should it really fence by turning node off?
My workaround for the missing functionality (or wrong default): [PATCH] dlm_stonith_{off,reboot} aliases for fence helper Greets, Jacek
[Cluster-devel] cluster4 dlm dlm_stonith – should it really fence by turning node off?
Hello, The dlm_stonith fencing helper is really convenient when Pacemaker is in use. Though, it doesn't quite work as I would expect – when fencing is needed it requests a node to be turned off instead of rebooting. And it doesn't handle unfencing – so automatic recovery is not possible (rebooted node could join the cluster cleanly later, provided quorum handling is properly configured in the cluster stack). Preferably this behaviour should be configurable. I have hacked a work-around by (ab)using argv[0] – when 'dlm_stonith' is called as 'dlm_stonith_reboot' the node would be rebooted instead of halting – this works for me well-enough, but I don't think this is the right solution. Any ideas how to solve that properly? An argument for the helper to be included in the config file? Or, maybe, just change the default behaviour? Greets, Jacek