On Thu, Jul 1, 2010 at 3:09 PM, Keisuke MORI <[email protected]> wrote: > Bad news... > > 2010/6/30 Andrew Beekhof <[email protected]>: >> On Wed, Jun 30, 2010 at 12:06 PM, Keisuke MORI >> <[email protected]> wrote: >>> 2010/6/29 Andrew Beekhof <[email protected]>: >>>> On Mon, Jun 28, 2010 at 2:20 PM, Keisuke MORI <[email protected]> >>>> wrote: >>>>> I've upgrade to pacemaker-1.0.9.1 / corosync-1.2.5 from clusterlabs on >>>>> CentOS 5.5 using yum but it still hangs on its startup somtimes. >>>>> >>>>> The symptom is exactly same as this: >>>>> https://lists.linux-foundation.org/pipermail/openais/2010-June/014854.html >>>> >>>> Arrgghhh!!! >>>> >>>> Can you try the following patch? >>> >>> With the patch the problem disappeared! >>> I've not been able to reproduce the hang with rebooting the node more >>> than 10 times (which was enough to reproduce it previously). > > It didn't happen yesterday, but the same hang occurred again today. > > I also tried with corosync-1.2.6 but the things didn't get better. > > Here is the stack trace and the corosync.conf when I reproduce it with > corosync-1.2.6. > According to the core, fileno=10 looks broken, while filno=0,1,2,3 seems sane.
Any chance you could do some digging to figure out where fileno=10 is coming from? _______________________________________________ Openais mailing list [email protected] https://lists.linux-foundation.org/mailman/listinfo/openais
