On 02/24/2011 04:33 AM, [email protected] wrote: > > Hi, > > my configuration has 2 nodes, one has a set of virtual adresses and a > webservice. The situation before crash: > node1: has all resources > node2: online, no resources > > action on node2: crm standby node2 > result on node1: corosync crashes, the child processes consume all available > cpu time > > my actions: stop all child processes on node1 (kill -9) and restart corosync > > result on node1: > node1: online, all resources > node2: offline > > result on node2: > node1: offline > node2: online, all resources > > The only way I found to workaround this problem: remove node2 from the > cluster and add it again. > There should be other solutions, maybe someone can help. Appended the > coredump and fplay. > If I keep this state (split brain), then the cluster seems to recover after > about 9 hours.... > > regards Uwe > >
The coredump you linked has a sigabort delivered to it, but I am unable to tell which line of code is asserting. Install the corosync-debuginfo package and run the bt operations again. Thanks! -steve > > > _______________________________________________ > Openais mailing list > [email protected] > https://lists.linux-foundation.org/mailman/listinfo/openais > https://lists.linux-foundation.org/mailman/listinfo/openais Hi Steve, Thanks for response, I have checked the available rhel6 packages and didn't find corosync-debuginfo..... regards Uwe
_______________________________________________ Openais mailing list [email protected] https://lists.linux-foundation.org/mailman/listinfo/openais
