Re: [Linux-HA] Another crash of heartbeat with SIGXCPU -- now I have strace!

2011-01-14 Thread Lars Ellenberg
On Thu, Jan 13, 2011 at 01:14:58PM -0600, Igor Chudov wrote: On Thu, Jan 13, 2011 at 10:55 AM, Lars Ellenberg lars.ellenb...@linbit.comwrote: On Thu, Jan 13, 2011 at 10:17:40AM -0600, Igor Chudov wrote: Again, after about 3-4 days of running, heartbeat master process dies with

Re: [Linux-HA] Another crash of heartbeat with SIGXCPU -- now I have strace!

2011-01-14 Thread Igor Chudov
On Fri, Jan 14, 2011 at 3:50 AM, Lars Ellenberg lars.ellenb...@linbit.comwrote: On Thu, Jan 13, 2011 at 01:14:58PM -0600, Igor Chudov wrote: On Thu, Jan 13, 2011 at 10:55 AM, Lars Ellenberg lars.ellenb...@linbit.comwrote: On Thu, Jan 13, 2011 at 10:17:40AM -0600, Igor Chudov wrote:

Re: [Linux-HA] Another crash of heartbeat with SIGXCPU -- now I have strace!

2011-01-14 Thread Igor Chudov
By the way... I am now restarting heartbeat every day on both nodes, at 22 hours on one node and at 23 hours on another. I hope that it will help it. i On Fri, Jan 14, 2011 at 7:50 AM, Igor Chudov ichu...@gmail.com wrote: On Fri, Jan 14, 2011 at 3:50 AM, Lars Ellenberg

[Linux-HA] Another crash of heartbeat with SIGXCPU -- now I have strace!

2011-01-13 Thread Igor Chudov
Again, after about 3-4 days of running, heartbeat master process dies with SIGXCPU. I was fortunate to run strace -p on it, so I captured strace. It looks like boring, garden variety regular work, and then heartbeat dies with SIGXCPU. The output is a bit lengthy. Is there some way to turn OFF

Re: [Linux-HA] Another crash of heartbeat with SIGXCPU -- now I have strace!

2011-01-13 Thread Lars Ellenberg
On Thu, Jan 13, 2011 at 10:17:40AM -0600, Igor Chudov wrote: Again, after about 3-4 days of running, heartbeat master process dies with SIGXCPU. I was fortunate to run strace -p on it, so I captured strace. It looks like boring, garden variety regular work, and then heartbeat dies with

Re: [Linux-HA] Another crash of heartbeat with SIGXCPU -- now I have strace!

2011-01-13 Thread Igor Chudov
On Thu, Jan 13, 2011 at 10:55 AM, Lars Ellenberg lars.ellenb...@linbit.comwrote: On Thu, Jan 13, 2011 at 10:17:40AM -0600, Igor Chudov wrote: Again, after about 3-4 days of running, heartbeat master process dies with SIGXCPU. I was fortunate to run strace -p on it, so I captured strace.