Hi,

On Tue, Aug 18, 2009 at 10:37:23AM -0400, Marshall, Richard wrote:
> So, looking through the tons of messages sent to the /var/log/ha-log
> doesn't help, How do I determine what the cryptic messages are try to
> convey,?

Of course it does help, otherwise we wouldn't be able to solve
any issues :)

The first things to look for are messages from lrmd (the local
resource manager). That is where the cluster software meets the
reality. All resource problems and output from resource agents
are logged by lrmd. In perfect world that should be enough to
figure almost all issues. Then there are stonithd messages which
tell you if/when a node was fenced. The crmd and pengine messages
are probably the most difficult to follow and you'd need a bit of
experience for that. crmd is the master of the show and pengine
is the program which makes resource placement decisions. Logs of
crmd and pengine are normally useful only to the expert(s) :)

Thanks,

Dejan

> Richard Marshall 
> Senior Technical Specialist 
> Enterprise Technology Group 
> Arbella Insurance Group 
> Hours 6 AM - 2 PM Mon. - Fri.
> 
> [email protected] 
> 617-328-2921 
> 
> 
> -----Original Message-----
> From: [email protected]
> [mailto:[email protected]] On Behalf Of Dejan
> Muhamedagic
> Sent: Tuesday, August 18, 2009 10:09 AM
> To: General Linux-HA mailing list
> Subject: Re: [Linux-HA] SUSE 10.1 HA
> 
> Hi,
> 
> On Tue, Aug 18, 2009 at 09:57:02AM -0400, Marshall, Richard wrote:
> > Hello:
> > 
> > New to HA and would like to ask what logs, files provide information 
> > to isolate when, why a resource failed over. Is there such 
> > documentation (i.e. HA error message Ref.)?
> 
> There is no catalogue of error messages. The resource failover is
> influenced by many factors, but usually happens to either resource or
> node failing. Ultimately, the decision is made based on scores for the
> resource and the node with the highest score gets to run the resource.
> 
> Thanks,
> 
> Dejan
> 
> > Thanks
> > 
> >  
> > Richard Marshall
> > Senior Technical Specialist
> > Enterprise Technology Group
> > Arbella Insurance Group
> > Hours 6 AM - 2 PM Mon. - Fri.
> > 
> > [email protected] <mailto:[email protected]>
> > 617-328-2921
> > 
> >  
> > This email message is intended only for the addressee(s) and contains
> information that may be confidential.  
> > If you are not the intended recipient please notify the sender by
> reply email and immediately delete this message. 
> > Use, disclosure or reproduction of this email by anyone other than the
> intended recipient(s) is strictly prohibited.
> 
> 
> 
> > _______________________________________________
> > Linux-HA mailing list
> > [email protected]
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to