I did some reconfiguration of the NICs and IP addresses on my 2-node test cluster (running heartbeat and Pacemaker on CentOS 5, slightly old versions but they have been working fine up to now on this and several other clusters). I am sure that the NIC configuration is correct and that the CIB has the correct modified data in it. Also the ha.cf file is correct. (I even tried switching from bcast to ucast, but that did not change the behavior).
The problem is that either node can come up and run all the resources, but as soon as I bring the other node online, it briefly looks normal, but as soon as the stonith resource starts, the currently running node gets fenced and the new node takes over all the resources. Then the fenced node comes up, fences the other node and takes over, etc. Death match. What I am looking for is just a hint about how to diagnose this. I have tried looking in the log file, but as everyone knows, those logs are incredibly voluminous, so I would like a hint about what to look for to diagnose this. Thank you, --Greg _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
