Hi all,

I'm in difficulty because of STONITH for the node that is in the middle
of doing kdump.
For example, when kernel panic occurs, kdump is executed in second
kernel on a node.
But it is killed by STONITH before kdump finishes, and consequently
nothing is dumped.
I know that waiting kdump to be over means increasing failover time.
But this is a serious problem for failure analysis.

So, I intend to make a STONITH plugin which checks a target node is
doing kdump or not.
It is for using with an usual sniper STONITH plugin in a group.
If the target node is doing kdump, the plugin considers that STONITH has
succeeded.

First, what do you think about this idea?
Your comments and suggestions are really appreciated.

Second, I would like to hear your opinion about the following.
I think a timeout setting shuold be necessary for STONITH plugin.

This is what I noticed while developing the plugin above,
tengine and parent-stonithd each have their timeout settings
based on "cluster-delay" and "default-action-timeout",
but child-stonithd doesn't have its own.
So, an user has no way to set definitely how long STONITH plugin may
take time.
To increase the values of "cluster-delay" and "default-action-timeout"
may permit a plugin to take longer time, but it is far from their
substance and its effect is so big.


Best Regards,
Satomi Taniguchi
_______________________________________________________
Linux-HA-Dev: [email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
Home Page: http://linux-ha.org/

Reply via email to