Dejan Muhamedagic wrote:
On Mon, Apr 23, 2007 at 04:08:46PM +0200, Max Hofer wrote:
I think i want the same functioanlity as Christopher wants:

* when a resource on a node goes into FAILED state, reboot the machine
(currently we have no STONITH device - i know, it is insecure but i have to use
what i have)

Heartbeat version 2.0.8

Situation:
* 2 node cluster
* dummy-resource provided by heartbeat runs on management2
* DC management1

Actions:
* touch /tmp/Dummy.monitor /tmp/Dummy.stop --> in this way monitor and stop opeartion fails

Afterwards:
* dummy-resource does not run anywhere
* stonithd seems to core dump

Seems? If this is on Linux, please do

# ulimit -c unlimited (from the same terminal, before starting the heartbeat)
# echo 1 > /proc/sys/kernel/core_uses_pid

Then you'll have the core dumps. That'd be nice (that's not to say
that core dumps are nice to have, but, well, you get my point :)

* reboot of management2 failed (?? may this be because the stonithd core dumps?)
* "etc/init.d/heartbeat stop" on management2 hangs forever

Here the attached CIB, pe-warn* from management1 and ha-log of both machine.

Don't see the CIB, though I can see them in the pe-input*.
cibadmin -Q would be good too. ha.cf perhaps too.

Have you read this one: http://linux-ha.org/ReportingProblems ?

Do i something wrong with the stonith device?

This is what I see:

lrmd[1319]: 2007/04/23_15:43:46 info: Try to start STONITH resource 
<rsc_id=child-do-reboot:1> : Device=suicide
stonithd[1274]: 2007/04/23_15:43:46 ERROR: Could not list nodes for stonith RA 
suicide.

Never used the suicide agent, so can't comment further now, but
there's definitely sth wrong here. Perhaps the agent's broken?
The STONITH plugin returns NULL for hostlist which tells the folks that called it that an error occurred when it really meant to say that it only STONITHs itself. So this should return an empty hostlist (pointer to NULL) or a hostlist containing itself. I have opened a bug (1550) and will fix after I'm done with 1545.

http://old.linux-foundation.org/developer_bugzilla/show_bug.cgi?id=1550
Alan, looks like we need stonith idioms too :)

Thanks.


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to