Greetings: I have the typical active-standby two node setup with a resource group consisting of an application and an IP address running on the active node.
I'm looking for ways of measuring the average time it takes for the system to failover from the active node to the standby node. I have been asked to have the system fail over from node A to node B and then after node B runs for a while the system would fail over from node B to node A. This flip-flop scenario would be carried out for say 1000 times. I have the following questions regarding this scenario: 1. Has anyone done this sort of measurements before? 2. Can Heartbeat handle this flip-flopping of >1000 times between the nodes? 3. Are there any scripts/code within the Heartbeat package that would assist in this situation? 4. What is the correct way of measuring this time interval, between one node becomes non-operational and the other node becomes active? 5. In the log files produced by Heartbeat (ha-debug, ha-log), the time stamps have resolution in seconds. Is it possible to get a finer resolution, say milliseconds? Thanks! Peter. _______________________________________________________ Linux-HA-Dev: [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev Home Page: http://linux-ha.org/
