Re: [Cluster-devel] cluster4 dlm dlm_stonith ??? should it really fence by turning node off?

2012-11-05 Thread David Teigland
On Sat, Nov 03, 2012 at 03:58:28PM +0100, Jacek Konieczny wrote:
 Hello,
 
 The dlm_stonith fencing helper is really convenient when Pacemaker is in
 use. Though, it doesn't quite work as I would expect ??? when fencing 
 is needed it requests a node to be turned off instead of rebooting. And
 it doesn't handle unfencing ??? so automatic recovery is not possible
 (rebooted node could join the cluster cleanly later, provided quorum
 handling is properly configured in the cluster stack).
 
 Preferably this behaviour should be configurable. I have hacked a
 work-around by (ab)using argv[0] ??? when 'dlm_stonith' is called as
 'dlm_stonith_reboot' the node would be rebooted instead of halting 
 ??? this works for me well-enough, but I don't think this is the right
 solution.

Could you send the patch?  Do you think the patch is not right or reboot
is not right?  If the later, what do you think is wrong with reboot?

 Any ideas how to solve that properly? An argument for the helper to be
 included in the config file? Or, maybe, just change the default
 behaviour?
 
 Greets,
 Jacek



Re: [Cluster-devel] cluster4 dlm dlm_stonith ??? should it really fence by turning node off?

2012-11-05 Thread Jacek Konieczny
My workaround for the missing functionality (or wrong default):

[PATCH] dlm_stonith_{off,reboot} aliases for fence helper

Greets,
Jacek



[Cluster-devel] cluster4 dlm dlm_stonith – should it really fence by turning node off?

2012-11-03 Thread Jacek Konieczny
Hello,

The dlm_stonith fencing helper is really convenient when Pacemaker is in
use. Though, it doesn't quite work as I would expect – when fencing 
is needed it requests a node to be turned off instead of rebooting. And
it doesn't handle unfencing – so automatic recovery is not possible
(rebooted node could join the cluster cleanly later, provided quorum
handling is properly configured in the cluster stack).

Preferably this behaviour should be configurable. I have hacked a
work-around by (ab)using argv[0] – when 'dlm_stonith' is called as
'dlm_stonith_reboot' the node would be rebooted instead of halting 
– this works for me well-enough, but I don't think this is the right
solution.

Any ideas how to solve that properly? An argument for the helper to be
included in the config file? Or, maybe, just change the default
behaviour?

Greets,
Jacek