Hello,
I encountered an oops on a disk with bad blocks while making
a backup from another machine via nfs to that machine (the
oops machine "atlanta" is nfs server).
The oops is:
Nov 13 03:48:37 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:48:37 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4292633, sector=935048
Nov 13 03:48:37 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 935048
Nov 13 03:48:37 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [530838 530840 0x0 SD]
Nov 13 03:48:43 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:48:43 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4292633, sector=935048
Nov 13 03:48:43 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 935048
Nov 13 03:48:43 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [530841 530842 0x0 SD]
Nov 13 03:50:23 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:23 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:23 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:23 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533903 533904 0x0 SD]
Nov 13 03:50:31 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:31 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:31 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:31 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533905 533906 0x0 SD]
Nov 13 03:50:37 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:37 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:37 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:37 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533905 533907 0x0 SD]
Nov 13 03:50:42 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:42 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:42 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:42 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533908 533909 0x0 SD]
Nov 13 03:50:47 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:47 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:47 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:47 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533910 533911 0x0 SD]
Nov 13 03:50:53 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:53 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:53 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:53 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533910 533912 0x0 SD]
Nov 13 03:51:00 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:00 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299969, sector=942384
Nov 13 03:51:00 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:06 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:06 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299970, sector=942384
Nov 13 03:51:06 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:11 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:11 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299970, sector=942384
Nov 13 03:51:11 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:20 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:20 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299975, sector=942384
Nov 13 03:51:20 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:25 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:31 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299975, sector=942384
Nov 13 03:51:31 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:31 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:31 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299975, sector=942384
Nov 13 03:51:31 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:36 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:36 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299975, sector=942384
Nov 13 03:51:36 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:41 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:41 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299975, sector=942384
Nov 13 03:51:41 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:41 atlanta kernel: zam-7001: io error in
reiserfs_find_entry
Nov 13 03:51:41 atlanta kernel: kernel BUG at prints.c:334!
Nov 13 03:51:41 atlanta kernel: invalid operand: 0000
Nov 13 03:51:41 atlanta kernel: CPU: 0
Nov 13 03:51:42 atlanta kernel: EIP: 0010:[<c0171cbb>]
Not tainted
Nov 13 03:51:42 atlanta kernel: EFLAGS: 00010286
Nov 13 03:51:42 atlanta kernel: eax: 0000001c ebx:
c0253580 ecx: c029eec0 edx: 00003008
Nov 13 03:51:42 atlanta kernel: esi: c1843e00 edi:
cf9bfe84 ebp: cf9bfe18 esp: cf9bfdd8
Nov 13 03:51:42 atlanta kernel: ds: 0018 es: 0018 ss:
0018
Nov 13 03:51:42 atlanta kernel: Process nfsd (pid: 2821,
stackpage=cf9bf000)
Nov 13 03:51:42 atlanta kernel: Stack: c0257ca6 0000014e
cf9bfe00 c02535ec c0165aa8 c1843e00 c0253580 00000000
Nov 13 03:51:42 atlanta kernel: c02535ec cf9bfe44
000824aa 0008264e 7d9bf77f 000001f4 00000000 00000003
Nov 13 03:51:42 atlanta kernel: d789c6e0 c0165bc0
cd5599c0 d789c73c 00000007 cf9bfe44 cf9bfe84 fffffff4
Nov 13 03:51:42 atlanta kernel: Call Trace: [<c0165aa8>]
[<c0165bc0>] [<c0146b03>] [<c013e74d>] [<c013e7fb>]
Nov 13 03:51:42 atlanta kernel: [<e0b352be>] [<c010867b>]
[<e0b3aa58>] [<e0b41fc0>] [<e0b325a3>] [<e0b41fc0>]
Nov 13 03:51:42 atlanta kernel: [<e0b198c5>] [<e0b419d8>]
[<e0b3238f>] [<e0b419c0>] [<c01055b4>]
Nov 13 03:51:42 atlanta kernel:
Nov 13 03:51:42 atlanta kernel: Code: 0f 0b 83 c4 08 68 e0
70 31 c0 85 f6 74 0f 0f b7 46 08 50 e8
The decoded oops:
atlanta:~ # dmesg|ksymoops -m /usr/src/linux/System.map
ksymoops 2.4.1 on i686 2.4.15-pre1. Options used
-V (default)
-k /proc/ksyms (default)
-l /proc/modules (default)
-o /lib/modules/2.4.15-pre1/ (default)
-m /usr/src/linux/System.map (specified)
Warning (compare_maps): mismatch on symbol nlmsvc_ops ,
lockd says e0b30a90,
/lib/modules/2.4.15-pre1/kernel/fs/lockd/lockd.o says
e0b2feec. Ignoring
/lib/modules/2.4.15-pre1/kernel/fs/lockd/lockd.o entry
Warning (compare_maps): mismatch on symbol nfs_debug ,
sunrpc says e0b23004,
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o says
e0b22ce4. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol nfsd_debug ,
sunrpc says e0b23008,
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o says
e0b22ce8. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol nlm_debug ,
sunrpc says e0b2300c,
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o says
e0b22cec. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol rpc_debug ,
sunrpc says e0b23000,
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o says
e0b22ce0. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol icmpv6_socket ,
ipv6 says e0af32e0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0af0460. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol
icmpv6_statistics , ipv6 says e0af12e0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aee460. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inet6_dev_count
, ipv6 says e0aef000,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aec180. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inet6_ifa_count
, ipv6 says e0aef004,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aec184. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inet6_protos ,
ipv6 says e0af1260,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aee3e0. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inetsw6 , ipv6
says e0aeefa0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aec120. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol ip6_ra_chain ,
ipv6 says e0af09a0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aedb20. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol ipv6_statistics
, ipv6 says e0aef1a0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aec320. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol raw_v6_htable ,
ipv6 says e0af11e0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aee360. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol rt6_stats , ipv6
says e0aef168,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aec2e8. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol udp_stats_in6 ,
ipv6 says e0af09e0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aedb60. Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
cpu: 0, clocks: 1335718, slice: 667859
kernel BUG at prints.c:334!
invalid operand: 0000
CPU: 0
EIP: 0010:[<c0171cbb>] Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010286
eax: 0000001c ebx: c0253580 ecx: c029eec0 edx:
00003008
esi: c1843e00 edi: cf9bfe84 ebp: cf9bfe18 esp:
cf9bfdd8
ds: 0018 es: 0018 ss: 0018
Process nfsd (pid: 2821, stackpage=cf9bf000)
Stack: c0257ca6 0000014e cf9bfe00 c02535ec c0165aa8 c1843e00
c0253580 00000000
c02535ec cf9bfe44 000824aa 0008264e 7d9bf77f 000001f4
00000000 00000003
d789c6e0 c0165bc0 cd5599c0 d789c73c 00000007 cf9bfe44
cf9bfe84 fffffff4
Call Trace: [<c0165aa8>] [<c0165bc0>] [<c0146b03>]
[<c013e74d>] [<c013e7fb>]
[<e0b352be>] [<c010867b>] [<e0b3aa58>] [<e0b41fc0>]
[<e0b325a3>] [<e0b41fc0>]
[<e0b198c5>] [<e0b419d8>] [<e0b3238f>] [<e0b419c0>]
[<c01055b4>]
Code: 0f 0b 83 c4 08 68 e0 70 31 c0 85 f6 74 0f 0f b7 46 08
50 e8
>>EIP; c0171cbb <reiserfs_panic+3b/68> <=====
Trace; c0165aa8 <reiserfs_find_entry+88/134>
Trace; c0165bc0 <reiserfs_lookup+6c/cc>
Trace; c0146b03 <d_alloc+1b/188>
Trace; c013e74d <lookup_hash+9d/f0>
Trace; c013e7fb <lookup_one_len+5b/6c>
Trace; e0b352be <[nfsd]nfsd_lookup+32e/468>
Trace; c010867b <do_IRQ+db/ec>
Trace; e0b3aa58 <[nfsd]nfsd3_proc_lookup+d4/e0>
Trace; e0b41fc0 <[nfsd]nfsd_procedures3+60/2c0>
Trace; e0b325a3 <[nfsd]nfsd_dispatch+d3/1a0>
Trace; e0b41fc0 <[nfsd]nfsd_procedures3+60/2c0>
Trace; e0b198c5 <[sunrpc]svc_process+28d/4d8>
Trace; e0b419d8 <[nfsd]nfsd_version3+0/10>
Trace; e0b3238f <[nfsd]nfsd+207/348>
Trace; e0b419c0 <[nfsd]nfsd_list+0/0>
Trace; c01055b4 <kernel_thread+28/38>
Code; c0171cbb <reiserfs_panic+3b/68>
00000000 <_EIP>:
Code; c0171cbb <reiserfs_panic+3b/68> <=====
0: 0f 0b ud2a <=====
Code; c0171cbd <reiserfs_panic+3d/68>
2: 83 c4 08 add $0x8,%esp
Code; c0171cc0 <reiserfs_panic+40/68>
5: 68 e0 70 31 c0 push $0xc03170e0
Code; c0171cc5 <reiserfs_panic+45/68>
a: 85 f6 test %esi,%esi
Code; c0171cc7 <reiserfs_panic+47/68>
c: 74 0f je 1d <_EIP+0x1d>
c0171cd8 <reiserfs_panic+58/68>
Code; c0171cc9 <reiserfs_panic+49/68>
e: 0f b7 46 08 movzwl 0x8(%esi),%eax
Code; c0171ccd <reiserfs_panic+4d/68>
12: 50 push %eax
Code; c0171cce <reiserfs_panic+4e/68>
13: e8 00 00 00 00 call 18 <_EIP+0x18>
c0171cd3 <reiserfs_panic+53/68>
16 warnings issued. Results may not be reliable.
The kernel is
Linux atlanta 2.4.15-pre1 #2 SMP Fri Nov 16 23:08:06 CET
2001 i686 unknown
and the disk is "IBM-DTLA-307075" (is that drive known for
failures?)
Is that bug known?
Is the filesystem in a state so that it needs a reiserfsck,
or is it sufficient to delete the data whose data or
metadata touches the bad blocks?
If a reiserfsck is needed, is there a ramdisk distribution
(kit) which allows you to do the work completely from the
network?
Xu�n.
--
Mit freundlichen Gr��en
Xu�n Baldauf
Medium.net Internet Server Software