Re: [Linux-HA] the service slows down my server severely

Dejan Muhamedagic Tue, 20 Apr 2010 03:02:50 -0700

Hi,

On Mon, Apr 19, 2010 at 08:02:31PM -0700, Kathy Lo wrote:
> Thanks for your reply.
>  
> I know the version we are using is very very old but it is
> impossible for us to upgrade to the latest version in a short
> period.
>  
> Is any other solution to solve the problem except upgrading the service?


We see these reports occasionally and the culprit is usually a
network problem. Communication is essential to clusters and you
have only one media configured for heartbeats. That is not
enough.

Thanks,

Dejan

> Kathy
> 
> 
> --- 2010年4月19日 星期一，Dejan Muhamedagic <[email protected]> 寫道﹕
> 
> 
> 寄件人: Dejan Muhamedagic <[email protected]>
> 主題: Re: [Linux-HA] the service slows down my server severely
> 收件人: "General Linux-HA mailing list" <[email protected]>
> 日期: 2010年4月19日,星期一,下午7:05
> 
> 
> Hi,
> 
> On Thu, Apr 15, 2010 at 04:38:42PM +0800, Kathy Lo wrote:
> > I installed heartbeat 2.0.3 in FC 4 running Kernel
> 
> Please upgrade, that's really ancient.
> 
> Thanks,
> 
> Dejan
> 
> > 2.6.11-1.1369_FC4smp on HP DL380 G4 Server machine for about 4
> > years.
> > 
> > Recently, after we reboot the server machine, heartbeat service repeatly 
> > generates the following messages in ha-debug and ha-log files and, finally, 
> > slow down the server machine severely. As a result, we need to switch off 
> > the machine by pressing the power button instead of running 'poweroff' 
> > command. This suitation only occurred in active node.
> >  
> > We have so many pairs of server machines using the same configuration.
> > However, this suitation only occurred in one to two pairs of server 
> > machines.
> > For other pairs of server machines, it runs normally even reboot so many 
> > times.
> >  
> > Do anyone can tell me what happen and how to solve this problem?
> >  
> > In a pair of server machines, it consists of 2 nodes, db01 and db02.
> >  
> > Here is ha.cf for db01:
> >  
> > debugfile /var/log/ha-debug
> > logfile /var/log/ha-log
> > logfacility local0
> > keepalive 1
> > deadtime 60
> > warntime 40
> > initdead 80
> > udpport 694
> > ucast eth2 192.168.10.22
> > auto_failback off
> > node db01
> > node db02
> > #respawn hacluster /usr/lib/heartbeat/ipfail
> > #apiauth ipfail gid=hacluster uid=hacluster
> >  
> > Here is ha.cf for db02:
> >  
> > debugfile /var/log/ha-debug
> > logfile /var/log/ha-log
> > logfacility local0
> > keepalive 1
> > deadtime 60
> > warntime 40
> > initdead 80
> > udpport 694
> > ucast eth2 192.168.10.12
> > auto_failback off
> > node db01
> > node db02
> > #respawn hacluster /usr/lib/heartbeat/ipfail
> > #apiauth ipfail gid=hacluster uid=hacluster
> >  
> > I attached part of the content of ha-debug file for your reference.
> > 
> > 
> > 
> >       
> 
> 
> > _______________________________________________
> > Linux-HA mailing list
> > [email protected]
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> 
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
> 
> 
> 
>       
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] the service slows down my server severely

Reply via email to