Hi, On Mon, Apr 19, 2010 at 08:02:31PM -0700, Kathy Lo wrote: > Thanks for your reply. > > I know the version we are using is very very old but it is > impossible for us to upgrade to the latest version in a short > period. > > Is any other solution to solve the problem except upgrading the service?
We see these reports occasionally and the culprit is usually a network problem. Communication is essential to clusters and you have only one media configured for heartbeats. That is not enough. Thanks, Dejan > Kathy > > > --- 2010年4月19日 星期一,Dejan Muhamedagic <[email protected]> 寫道﹕ > > > 寄件人: Dejan Muhamedagic <[email protected]> > 主題: Re: [Linux-HA] the service slows down my server severely > 收件人: "General Linux-HA mailing list" <[email protected]> > 日期: 2010年4月19日,星期一,下午7:05 > > > Hi, > > On Thu, Apr 15, 2010 at 04:38:42PM +0800, Kathy Lo wrote: > > I installed heartbeat 2.0.3 in FC 4 running Kernel > > Please upgrade, that's really ancient. > > Thanks, > > Dejan > > > 2.6.11-1.1369_FC4smp on HP DL380 G4 Server machine for about 4 > > years. > > > > Recently, after we reboot the server machine, heartbeat service repeatly > > generates the following messages in ha-debug and ha-log files and, finally, > > slow down the server machine severely. As a result, we need to switch off > > the machine by pressing the power button instead of running 'poweroff' > > command. This suitation only occurred in active node. > > > > We have so many pairs of server machines using the same configuration. > > However, this suitation only occurred in one to two pairs of server > > machines. > > For other pairs of server machines, it runs normally even reboot so many > > times. > > > > Do anyone can tell me what happen and how to solve this problem? > > > > In a pair of server machines, it consists of 2 nodes, db01 and db02. > > > > Here is ha.cf for db01: > > > > debugfile /var/log/ha-debug > > logfile /var/log/ha-log > > logfacility local0 > > keepalive 1 > > deadtime 60 > > warntime 40 > > initdead 80 > > udpport 694 > > ucast eth2 192.168.10.22 > > auto_failback off > > node db01 > > node db02 > > #respawn hacluster /usr/lib/heartbeat/ipfail > > #apiauth ipfail gid=hacluster uid=hacluster > > > > Here is ha.cf for db02: > > > > debugfile /var/log/ha-debug > > logfile /var/log/ha-log > > logfacility local0 > > keepalive 1 > > deadtime 60 > > warntime 40 > > initdead 80 > > udpport 694 > > ucast eth2 192.168.10.12 > > auto_failback off > > node db01 > > node db02 > > #respawn hacluster /usr/lib/heartbeat/ipfail > > #apiauth ipfail gid=hacluster uid=hacluster > > > > I attached part of the content of ha-debug file for your reference. > > > > > > > > > > > > _______________________________________________ > > Linux-HA mailing list > > [email protected] > > http://lists.linux-ha.org/mailman/listinfo/linux-ha > > See also: http://linux-ha.org/ReportingProblems > > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems > > > > > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
