Hi all,
Originally, I thought the "monitor" action would be called by heartbeat
periodically, and heartbeat would then send the
return code to other nodes in the heartbeat cluster. So the nodes can get the
status of the software service, and if the software service crashed other nodes
can take over that software service.
I wrote a test OCF Resource Agents called "myagent", it did nothing but
log the action call message such as "monitor my test agent!","stop my test
agent!","start my test agent!". But I found the monitor action was not called
by the heartbeat periodically. It just would be called when the heartbeat
started.
So I just can't understand about these. If the monitor action is not
called periodically, how can the nodes know if the
software service being monitored runs normally. If the software service
crashed suddenly, can the heartbeat nodes detected?
How can I keep the high availability of a software service with heartbeat?
Thanks in advance._______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems