pciide1:1:0: bus-master DMA error: missing interrupt, status=0x20
wd3a: device timeout writing fsbn 154732672 of 154732672-154732703  (wd3
bn 154732735; cn 153504 tn 11 sn 10), retrying
wd3: soft error (corrected)
wd3(pciide1:1:0): timeout
        type: ata
        c_bcount: 16384
        c_skip: 0

It's trying to write and it recovers; which means hey I ran into a bad block and I was able to recover from that. All's good!

Enough of these means your disk is about to go tits up.

Do a "dd if=/dev/rwd3c of=/dev/null bs=1m" that should at least clear up the worst issues. Do it a few days later and see if there are new errors. If so toss the drive; if not you can trust it as far as you can throw it.

On Oct 3, 2005, at 10:27 PM, Gordon Willem Klok wrote:

I think he (Steve) is correct in his diagnosis, the drive being bad seems logical however I have been chasing some problems with ioapic and interrupts myself on a similar setup, the drive in question is attached to pciide1 which shares interrupt 17 with possibly bunch of other devices in the dmesg e.g. auich and fxp, and the soft error being corrected is missing interrupt, just a guess on my part but I have been having similar problems and have just started recently to go poking about for answers. What I can suggest is that he attempt to maybe disable some of the device he doesn't use e.g.
serial,parallel,midi or game ports and hope that the bios uses these
interrupts for something else (hasn't worked for me with my bios but its
worth a shot I guess).

GWK

Marco Peereboom wrote:

Dude you're disk is dying on you.  Replace it ASAP.
On Oct 3, 2005, at 11:57 AM, Steve Harding wrote:

I have been chasing intermittent problems with my hard disks for a while
now, and have replaced nearly everything, including drives, in an
attempt to fix them. I had convinced myself that it must be a
motherboard problem so I just swapped out to the one listed below. Disk errors show up at the end of the dmesg. This machine acts as a backup server, with data coming in from a Windows machine (via samba) and then
a mass of rsync and gtar/gzip activity.

What I was wondering is whether the problem might be something other
than hardware. Any thoughts would be appreciated.

Thanks, Steve

OpenBSD 3.7 (GENERIC.MP) #50: Sun Mar 20 00:17:19 MST 2005
[EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/ GENERIC.MP

cpu0: AMD Athlon(tm) MP 2000+ ("AuthenticAMD" 686-class) 1.67 GHz
cpu0:
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PS E3 6,MMX,FXSR,SSE
real mem  = 1073258496 (1048104K)
avail mem = 972795904 (949996K)
using 4278 buffers containing 53764096 bytes (52504K) of memory
mainbus0 (root)
bios0 at mainbus0: AT/286+(f6) BIOS, date 03/05/02, BIOS32 rev. 0 @
0xfb100
apm0 at bios0: Power Management spec V1.2
apm0: AC on, battery charge unknown
pcibios0 at bios0: rev 2.1 @ 0xf0000/0xdf94
pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfdec0/208 (11 entries)
pcibios0: PCI Exclusive IRQs: 5 11
pcibios0: no compatible PCI ICU found
pcibios0: Warning, unable to fix up PCI interrupt routing
pcibios0: PCI bus #2 is the last bus
bios0: ROM list: 0xc0000/0xa800 0xcc000/0x2800 0xcf000/0x1800
mainbus0: Intel MP Specification (Version 1.4) (OEM00000 PROD00000000)
cpu0 at mainbus0: apid 0 (boot processor)
k7_powernow: couldn't map BIOS
cpu0: apic clock running at 266 MHz
cpu1 at mainbus0: apid 1 (application processor)
cpu1: AMD Athlon(tm) MP 2000+ ("AuthenticAMD" 686-class) 1.67 GHz
cpu1: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV
mainbus0: bus 0 is type PCI
mainbus0: bus 1 is type PCI
mainbus0: bus 2 is type PCI
mainbus0: bus 3 is type ISA
ioapic0 at mainbus0: apid 2 pa 0xfec00000, version 11, 24 pins
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 "AMD 762 PCI" rev 0x11
ppb0 at pci0 dev 1 function 0 "AMD 762 PCI-PCI" rev 0x00
pci1 at ppb0 bus 1
vga1 at pci1 dev 5 function 0 "Nvidia Vanta" rev 0x15
wsdisplay0 at vga1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
pcib0 at pci0 dev 7 function 0 "AMD 768 ISA" rev 0x05
pciide0 at pci0 dev 7 function 1 "AMD 768 IDE" rev 0x04: DMA, channel 0
configured to compatibility, channel 1 configured to compatibility
wd0 at pciide0 channel 0 drive 0: <ST340014A>
wd0: 16-sector PIO, LBA48, 38166MB, 78165360 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 5
atapiscsi0 at pciide0 channel 1 drive 0
scsibus0 at atapiscsi0: 2 targets
cd0 at scsibus0 targ 0 lun 0: <SONY, DVD-ROM DDU1621, S3.3> SCSI0
5/cdrom removable
cd0(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2
"AMD 768 Power Mgmt" rev 0x03 at pci0 dev 7 function 3 not configured auich0 at pci0 dev 7 function 5 "AMD 768 AC97" rev 0x03: apic 2 int 17
(irq 11), AMD768 AC97
ac97: codec id 0x49434511 (ICEnsemble ICE1232)
ac97: codec features headphone, 18 bit DAC, 18 bit ADC, KS Waves 3D
audio0 at auich0
pciide1 at pci0 dev 9 function 0 "Promise PDC20269" rev 0x02: DMA,
channel 0 configured to native-PCI, channel 1 configured to native-PCI
pciide1: using apic 2 int 17 (irq 11) for native-PCI interrupt
wd1 at pciide1 channel 0 drive 0: <Maxtor 7Y250P0>
wd1: 16-sector PIO, LBA48, 239372MB, 490234752 sectors
wd2 at pciide1 channel 0 drive 1: <ST3250823A>
wd2: 16-sector PIO, LBA48, 238475MB, 488397168 sectors
wd1(pciide1:0:0): using PIO mode 4, Ultra-DMA mode 6
wd2(pciide1:0:1): using PIO mode 4, Ultra-DMA mode 5
wd3 at pciide1 channel 1 drive 0: <ST3300831A>
wd3: 16-sector PIO, LBA48, 286168MB, 586072368 sectors
wd3(pciide1:1:0): using PIO mode 4, Ultra-DMA mode 5
ppb1 at pci0 dev 16 function 0 "AMD 768 PCI-PCI" rev 0x05
pci2 at ppb1 bus 2
ohci0 at pci2 dev 0 function 0 "AMD 768 USB" rev 0x07: apic 2 int 19
(irq 11), version 1.0, legacy support
usb0 at ohci0: USB revision 1.0
uhub0 at usb0
uhub0: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 4 ports with 4 removable, self powered
em0 at pci2 dev 6 function 0 "Intel PRO/1000MT (82540EM)" rev 0x02:
apic 2 int 18 (irq 5), address: 00:0e:0c:06:b6:ea
fxp0 at pci2 dev 9 function 0 "Intel 82559ER" rev 0x09, i82559S: apic 2
int 17 (irq 11), address 00:10:dc:4c:ee:e3
inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
isa0 at pcib0
isadma0 at isa0
pckbc0 at isa0 port 0x60/5
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0 (mux 1 ignored for console): console keyboard, using
wsdisplay0
pms0 at pckbc0 (aux slot)
pckbc0: using irq 12 for aux slot
wsmouse0 at pms0 mux 0
pcppi0 at isa0 port 0x61
midi0 at pcppi0: <PC speaker>
sysbeep0 at pcppi0
lpt0 at isa0 port 0x378/4 irq 7
lm0 at isa0 port 0x290/8: W83627HF
npx0 at isa0 port 0xf0/16: using exception 16
pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
pccom1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
fdc0 at isa0 port 0x3f0/6 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec
biomask 0 netmask 0 ttymask 0
ioapic0: pin 17 shares different IPL interrupts (40..90), degraded
performance
pctr: user-level cycle counter enabled
mtrr: Pentium Pro MTRR support
apm0: disconnected
dkcsum: wd0 matched BIOS disk 80
dkcsum: wd1 matched BIOS disk 81
dkcsum: wd2 matched BIOS disk 82
dkcsum: wd3 matched BIOS disk 83
root on wd0a
rootdev=0x0 rrootdev=0x300 rawdev=0x302
k7_powernow: couldn't map BIOS
wd3(pciide1:1:0): timeout
        type: ata
        c_bcount: 16384
        c_skip: 0
pciide1:1:0: bus-master DMA error: missing interrupt, status=0x20
wd3a: device timeout writing fsbn 154732672 of 154732672-154732703 (wd3
bn 154732735; cn 153504 tn 11 sn 10), retrying
wd3: soft error (corrected)
wd3(pciide1:1:0): timeout
        type: ata
        c_bcount: 16384
        c_skip: 0
pciide1:1:0: bus-master DMA error: missing interrupt, status=0x20
wd3a: device timeout writing fsbn 155063264 of 155063264-155063295 (wd3
bn 155063327; cn 153832 tn 10 sn 41), retrying
wd3: soft error (corrected)
wd3(pciide1:1:0): timeout
        type: ata
        c_bcount: 16384
        c_skip: 0
pciide1:1:0: bus-master DMA error: missing interrupt, status=0x20
wd3a: device timeout writing fsbn 155063360 of 155063360-155063391 (wd3
bn 155063423; cn 153832 tn 12 sn 11), retrying
wd3: soft error (corrected)
wd3(pciide1:1:0): timeout
        type: ata
        c_bcount: 16384
        c_skip: 0
pciide1:1:0: bus-master DMA error: missing interrupt, status=0x20
wd3a: device timeout writing fsbn 165973792 of 165973792-165973823 (wd3
bn 165973855; cn 164656 tn 9 sn 40), retrying
wd3: soft error (corrected)
wd3(pciide1:1:0): timeout
        type: ata
        c_bcount: 16384
        c_skip: 0
pciide1:1:0: bus-master DMA error: missing interrupt, status=0x20
wd3a: device timeout writing fsbn 167627328 of 167627328-167627359 (wd3
bn 167627391; cn 166297 tn 0 sn 15), retrying
wd3: soft error (corrected)
wd3(pciide1:1:0): timeout
        type: ata
        c_bcount: 16384
        c_skip: 0
pciide1:1:0: bus-master DMA error: missing interrupt, status=0x20
wd3a: device timeout writing fsbn 167957952 of 167957952-167957983 (wd3
bn 167958015; cn 166625 tn 0 sn 15), retrying
wd3: soft error (corrected)
wd3(pciide1:1:0): timeout
        type: ata
        c_bcount: 16384
        c_skip: 0
pciide1:1:0: bus-master DMA error: missing interrupt, status=0x20
wd3a: device timeout writing fsbn 166965888 of 166965888-166965919 (wd3
bn 166965951; cn 165640 tn 13 sn 12), retrying
wd3: soft error (corrected)
wd3(pciide1:1:0): timeout
        type: ata
        c_bcount: 16384
        c_skip: 0
pciide1:1:0: bus-master DMA error: missing interrupt, status=0x20
wd3: transfer error, downgrading to Ultra-DMA mode 4
wd3(pciide1:1:0): using PIO mode 4, Ultra-DMA mode 4
wd3a: device timeout writing fsbn 167627168 of 167627168-167627199 (wd3
bn 167627231; cn 166296 tn 13 sn 44), retrying
wd3: soft error (corrected)
wd2(pciide1:0:1): timeout
        type: ata
        c_bcount: 65536
        c_skip: 0
pciide1:0:1: bus-master DMA error: missing interrupt, status=0x60
wd2a: device timeout writing fsbn 453771584 of 453771584-453771711 (wd2
bn 453771647; cn 450170 tn 4 sn 35), retrying
wd2: soft error (corrected)
wd2(pciide1:0:1): timeout
        type: ata
        c_bcount: 65536
        c_skip: 0
pciide1:0:1: bus-master DMA error: missing interrupt, status=0x60
wd2: transfer error, downgrading to Ultra-DMA mode 4
wd1(pciide1:0:0): using PIO mode 4, Ultra-DMA mode 6
wd2(pciide1:0:1): using PIO mode 4, Ultra-DMA mode 4
wd2a: device timeout writing fsbn 453775168 of 453775168-453775295 (wd2
bn 453775231; cn 450173 tn 13 sn 28), retrying
wd2: soft error (corrected)

Contents of /etc/fstab:

/dev/wd0a / ffs rw 1 1
/dev/wd0h /home ffs rw,nodev,nosuid 1 2
/dev/wd0j /pub ffs rw,nodev,nosuid 1 2
/dev/wd0d /tmp ffs rw,nodev,nosuid 1 2
/dev/wd0g /usr ffs rw,nodev,softdep 1 2
/dev/wd0e /var ffs rw,nodev,nosuid 1 2
/dev/wd1a /pub/test ffs rw,nodev,softdep 1 2
/dev/wd1d /pub/archives ffs rw,nodev,softdep 1 2
/dev/wd2a /pub/backups ffs rw,nodev,softdep 1 2
/dev/wd3a /pub/mirror ffs rw,nodev,softdep 1 2

Reply via email to