Steven Dake writes: > Self-healing is not as obvious or easy as it sounds. Totem (the > protocol) has no way to determine when the admin has replaced the faulty > switch in the network.
why can't it keep on pinging the interface/ip address even if there is no response? how is it with pingd, does pinging stop if there is no response and does the node remain dead forever after ping failure until someone manually does something? i don't see any difference here regarding heartbeat/openais level pinging. > The only options I see is to periodically try the failed ring for > liveness. The problem with this approach is it is hard to implement. try all the time also after failure like was done before failure. > I think the first option is the best, but atm there isn't anyone that > has written patches and most people are focused on the 1.0 release... 1.0 release that people cannot migrate from current heartbeat/pacemaker setup without loosing self healing capability makes little sense. -- juha _______________________________________________ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker