Volker Kuhlmann wrote:

I'm trying to find the cause of some kernel lockups which occur when
accessing the disks. Athlon XP 1800+, KT266/A/333 chipset, 1.5GB RAM,
SuSE 9.2. /dev/hda 200G Seagate, /dev/hdb 80G Seagate, both installed in
removable caddies.

/dev/hd[ab]:
multcount    = 16 (on)
IO_support   =  1 (32-bit)
unmaskirq    =  1 (on)
using_dma    =  1 (on)

Unpacking a tar file with 21000 files and 400MB into hdb1 crashes the
kernel. Syslog shows something like this:

...
kernel: ide: failed opcode was: unknown
kernel: hdb: dma_intr: status=0x51 { DriveReady SeekComplete Error }
kernel: hdb: dma_intr: error=0x84 { DriveStatusError BadCRC }
kernel: ide: failed opcode was: unknown
kernel: hdb: dma_intr: status=0x51 { DriveReady SeekComplete Error }
kernel: hdb: dma_intr: error=0x84 { DriveStatusError BadCRC }
kernel: ide: failed opcode was: unknown
kernel: hda: DMA disabled
kernel: ide0: reset: success

Reboot. Two concurrent dd thrashing the disk

dd bs=1k if=/dev/hdb1 of=/dev/null skip=60000000 &
dd bs=1k if=/dev/zero of=/dev/hdb1 count=1000000 &

fail to cause any trouble. What can I try next?

Any ideas much appreciated,

Volker



Volker,

Your disk's b*ggered. Switch it off for 24 hours, then get as much off as you can, print off the error messages, and then take it back under warranty!

I had the same thing happen to me, and did a load of research. What it boils down to is that is you *ever* see a disk error that hasn't been caught by the dozens of layers of error correction that it goes through, then it's dead.

Mine kicked me out of DMA mode as well, so the backups were done at almost 1MB/sec!

I've got a small 40GB in an USB 2 enclosure if it's any help.

Steve

Reply via email to