On 12/06/2013, at 1:04 AM, Stefan Schloesser <[email protected]> wrote:
> Hi, > > I have a setup with 2 nodes, drbd, mysql and apache. Rather too often for my > liking (1 per month) one node is killed (fenced) by the other. Each time I am > unable to find out what actually caused this behaviour. > I can see in the logs that suddenly one node is fenced or stonith but no > error appears as to why this happens. > Each time I can simple start the node and corosync and everything works fine > again i.e. no fault is apparent. > > I already thought about auto starting corosync, but that does seem like a > good idea. I tried trimming the communication params (totem) to no avail. > > So my question is this. What's the best way to finde the cause? Following http://blog.clusterlabs.org/blog/2013/debugging-pacemaker/ in reverse might provide some illumination. _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
