I did some reconfiguration of the NICs and IP addresses on my 2-node
test cluster (running heartbeat and Pacemaker on CentOS 5, slightly old
versions but they have been working fine up to now on this and several
other clusters). I am sure that the NIC configuration is correct and
that the CIB has the correct modified data in it. Also the ha.cf file is
correct. (I even tried switching from bcast to ucast, but that did not
change the behavior).

The problem is that either node can come up and run all the resources,
but as soon as I bring the other node online, it briefly looks normal,
but as soon as the stonith resource starts, the currently running node
gets fenced and the new node takes over all the resources. Then the
fenced node comes up, fences the other node and takes over, etc. Death
match.

What I am looking for is just a hint about how to diagnose this. I have
tried looking in the log file, but as everyone knows, those logs are
incredibly voluminous, so I would like a hint about what to look for to
diagnose this.

Thank you,
--Greg


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to