> We are running SUSE 7.2 in an IFL with z/VM 4.1.  On this machine we are runn
> ing UDBEE 7.2.    Early this morning the machine hung up.   I have very littl
> e information to go on.   Here is what I have.
>
> /var/log/messages
> After 01:13 there were no new messages.   The last message  showed an REXEC s
> ession starting.   This REXEC is started from an OS/390 machine and has worke
> d fine for weeks.
>
> Telnet connections would time-out.
>
> I do NOT know if the VM user was consuming CPU or doing I/O.
>
> The machine was recycled and now the REXEC-started procedure works.
>
> I instructed the people involved with this on how to do an INDICATE USER comm
> and to gather some VM-perspective information.   Are there any logs other tha
> n /var/log/messages that could be helpful?    Is there something that I could
>  turn on to further trace activity on the system?


There is an switch for syslogd that causes it to say --MARK-- from time to time
if there have been no other messages. RH turned it off some years ago to stop
people from asking what was wrong....

You might also take a look at software such as heartbeat (I think that's the
name) used for high-availability Linux to detect when a system is sick.

The book I keep recommending, Reliable Linux, has information about it. Everyone
should go out and buy a copy;-)



--
Cheers
John Summerfield

Microsoft's most solid OS: http://www.geocities.com/rcwoolley/

Note: mail delivered to me is deemed to be intended for me, for my disposition.

==============================
If you don't like being told you're wrong,
        be right!

Reply via email to