On 2011-07-04T15:16:38, Ulrich Windl <[email protected]> wrote:
> You are correct: When I had STONITH enabled both nodes were periodically
> rebooting. That was not fun. I'm trying to find out what's going on. Not as
> easy as I'd wish...
>
> I feel CRM is in "insulted mode": It does very little with failed resources.
> Do I really have to reboot the node to enable resource management?
It tried to stop the resource, and it couldn't; so the resource is in an
undefined state, can't be started elsewhere, and the clean-up operation
(stop) has already failed. Thus, by default, it will fence the node to
recover.
If you want to disable this for testing, you can set the meta
on-fail="block" attribute. (That'd make a poor default since it'd block
automatic recovery.)
Then, once you've manually cleaned up the error, you can use "crm
resource cleanup" to tell the cluster to try again.
Regards,
Lars
--
Architect Storage/HA, OPS Engineering, Novell, Inc.
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB
21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems