Hi,

I'm responsible for looking after a couple of machines that
run some fairly large statistical analysis jobs.

One machine (Dell PowerEdge 6300, quad Pentium III, 4GB RAM, RH-7.1)
does most of the processing, often running jobs as large as 1.8GB/process.

Since upgrading to RH-7.1, the machine occasionally runs out of resources.
This seems to happen while our backup is running (we use BRU to backup to
a DDS-4).

I can ping the machine but cannot log in and a hard reset is needed.

Before the reset, I usually see messages like this on the console:

(scsi2:A:6): 20.000MB/s transfers (20.000MHz, offset 15)
st0: Block limits 1 - 16777215 bytes.
mm: critical shortage of bounce buffers.
(scsi1:A:1:0): Locking max tag count at 64
(scsi1:A:3:0): Locking max tag count at 64

Sometimes there are references to killed jobs because of lack of resources.

The SCSI controller for the disks is an Adaptec aic7890/91 Ultra2.
The tape drive uses an Adaptec aic7860 SCSI adapter.

The reference to: mm: critical shortage of bounce buffers
can be found in mm/highmem.c.

The strange thing is, at no point does swap seem to get used, no matter how
heavily loaded the machine is.

If anyone can shed some light on this it would be much appreciated.

-------------------------------------

Steve Batson
System Administrator
Victorian Institute of Animal Science
Victoria, Australia
Email: [EMAIL PROTECTED]

--------------------------------------




_______________________________________________
Seawolf-list mailing list
[EMAIL PROTECTED]
https://listman.redhat.com/mailman/listinfo/seawolf-list

Reply via email to