> We are running SUSE 7.2 in an IFL with z/VM 4.1. On this machine we are runn > ing UDBEE 7.2. Early this morning the machine hung up. I have very littl > e information to go on. Here is what I have. > > /var/log/messages > After 01:13 there were no new messages. The last message showed an REXEC s > ession starting. This REXEC is started from an OS/390 machine and has worke > d fine for weeks. > > Telnet connections would time-out. > > I do NOT know if the VM user was consuming CPU or doing I/O. > > The machine was recycled and now the REXEC-started procedure works. > > I instructed the people involved with this on how to do an INDICATE USER comm > and to gather some VM-perspective information. Are there any logs other tha > n /var/log/messages that could be helpful? Is there something that I could > turn on to further trace activity on the system?
There is an switch for syslogd that causes it to say --MARK-- from time to time if there have been no other messages. RH turned it off some years ago to stop people from asking what was wrong.... You might also take a look at software such as heartbeat (I think that's the name) used for high-availability Linux to detect when a system is sick. The book I keep recommending, Reliable Linux, has information about it. Everyone should go out and buy a copy;-) -- Cheers John Summerfield Microsoft's most solid OS: http://www.geocities.com/rcwoolley/ Note: mail delivered to me is deemed to be intended for me, for my disposition. ============================== If you don't like being told you're wrong, be right!
