Volker Kuhlmann wrote:
I'm trying to find the cause of some kernel lockups which occur when accessing the disks. Athlon XP 1800+, KT266/A/333 chipset, 1.5GB RAM, SuSE 9.2. /dev/hda 200G Seagate, /dev/hdb 80G Seagate, both installed in removable caddies.
/dev/hd[ab]: multcount = 16 (on) IO_support = 1 (32-bit) unmaskirq = 1 (on) using_dma = 1 (on)
Unpacking a tar file with 21000 files and 400MB into hdb1 crashes the kernel. Syslog shows something like this:
... kernel: ide: failed opcode was: unknown kernel: hdb: dma_intr: status=0x51 { DriveReady SeekComplete Error } kernel: hdb: dma_intr: error=0x84 { DriveStatusError BadCRC } kernel: ide: failed opcode was: unknown kernel: hdb: dma_intr: status=0x51 { DriveReady SeekComplete Error } kernel: hdb: dma_intr: error=0x84 { DriveStatusError BadCRC } kernel: ide: failed opcode was: unknown kernel: hda: DMA disabled kernel: ide0: reset: success
Reboot. Two concurrent dd thrashing the disk
dd bs=1k if=/dev/hdb1 of=/dev/null skip=60000000 & dd bs=1k if=/dev/zero of=/dev/hdb1 count=1000000 &
fail to cause any trouble. What can I try next?
Any ideas much appreciated,
Volker
Volker,
Your disk's b*ggered. Switch it off for 24 hours, then get as much off as you can, print off the error messages, and then take it back under warranty!
I had the same thing happen to me, and did a load of research. What it boils down to is that is you *ever* see a disk error that hasn't been caught by the dozens of layers of error correction that it goes through, then it's dead.
Mine kicked me out of DMA mode as well, so the backups were done at almost 1MB/sec!
I've got a small 40GB in an USB 2 enclosure if it's any help.
Steve
