On Tue, Jan 4, 2011 at 9:40 AM, Serge Dubrouski <[email protected]> wrote:
> Which OS? > > Ubuntu 10.04 Lucid. > Which version of Hearbeat? > > 3.0.3 ~# apt-cache policy heartbeat heartbeat: Installed: 1:3.0.3-1ubuntu1 Candidate: 1:3.0.3-1ubuntu1 Version table: *** 1:3.0.3-1ubuntu1 0 <heartbeat_pid> - PID of which of Heartbeat processes? It has several. > > > I used pid of master control process. i > On Tue, Jan 4, 2011 at 6:32 AM, Igor Chudov <[email protected]> wrote: > > A few weeks I reported that heartbeat died on one of the cluster > machines, > > due to SIGXCPU. > > > > Well, it happened again. Heartbeat died, now both machines had the shared > IP > > address up, what a god awful mess!!! > > > > Nopw they have split brain and the whole nine yards! > > > > I looked at /proc/<heartbeat_pid>/limits and found: > > > > Limit Soft Limit Hard Limit Units > > > > Max cpu time 43 unlimited > seconds > > > > > > So, this process somehow has a limit set for it. > > > > Does anyone have ANY clue who would set a limit for this process??? WTF? > > Does it do it for itself or what? > > _______________________________________________ > > Linux-HA mailing list > > [email protected] > > http://lists.linux-ha.org/mailman/listinfo/linux-ha > > See also: http://linux-ha.org/ReportingProblems > > > > > > -- > Serge Dubrouski. > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems > _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
