On 08/07/13 03:48, Andreas Mock wrote:
Hi all,
I'm just wondering what the best way is to
let an admin know that the cluster (rest of
a cluster) has stonithed some other nodes?
What is the recommended way?
(The fact that the machine rebooted or is
halted is not the problem. I want to know
that stonithing was done)
Best regards
Andreas Mock
Personally, I have a little monitoring script I wrote that watches the
cluster resources, local hardware (via the IPMI BMC), UPSes and
what-not. It loop every 30 seconds and sends an email if/when anything
of note changes. A node being fenced certainly raises a flag and emails
go out.
My script is principally for cman + rgmanager, but it should be easy to
craft your own, too. I just read in the current state of things, compare
against the values in the last scan, decide whether to send an email or
not, copy the just-read values over to the last-scan values and delete
the "new" values and go back to sleep for 30 seconds.
hth
--
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without
access to education?
_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org