On 02/24/2011 04:33 AM, [email protected] wrote:
> 
> Hi,
> 
> my configuration has 2 nodes, one has a set of virtual adresses and a 
> webservice. The situation before crash:
> node1: has all resources
> node2: online, no resources
> 
> action on node2: crm standby node2
> result on node1: corosync crashes, the child processes consume all available 
> cpu time
> 
> my actions: stop all child processes on node1 (kill -9) and restart corosync
> 
> result on node1:
> node1: online, all resources
> node2: offline
> 
> result on node2:
> node1: offline
> node2: online, all resources
> 
> The only way I found to workaround this problem: remove node2 from the 
> cluster and add it again.
> There should be other solutions, maybe someone can help. Appended the 
> coredump and fplay.
> If I keep this state (split brain), then the cluster seems to recover after 
> about 9 hours....
> 
> regards Uwe
> 
> 

The coredump you linked has a sigabort delivered to it, but I am unable
to tell which line of code is asserting.

Install the corosync-debuginfo package and run the bt operations again.

Thanks!
-steve


> 
> 
> _______________________________________________
> Openais mailing list
> [email protected]
> https://lists.linux-foundation.org/mailman/listinfo/openais 
> https://lists.linux-foundation.org/mailman/listinfo/openais 

Hi Steve,

Thanks for response, I have checked the available rhel6 packages and didn't 
find corosync-debuginfo.....

regards Uwe

_______________________________________________
Openais mailing list
[email protected]
https://lists.linux-foundation.org/mailman/listinfo/openais

Reply via email to