Hello,

I encountered an oops on a disk with bad blocks while making
a backup from another machine via nfs to that machine (the
oops machine "atlanta" is nfs server).

The oops is:

Nov 13 03:48:37 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:48:37 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4292633, sector=935048
Nov 13 03:48:37 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 935048
Nov 13 03:48:37 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [530838 530840 0x0 SD]
Nov 13 03:48:43 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:48:43 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4292633, sector=935048
Nov 13 03:48:43 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 935048
Nov 13 03:48:43 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [530841 530842 0x0 SD]
Nov 13 03:50:23 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:23 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:23 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:23 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533903 533904 0x0 SD]
Nov 13 03:50:31 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:31 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:31 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:31 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533905 533906 0x0 SD]
Nov 13 03:50:37 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:37 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:37 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:37 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533905 533907 0x0 SD]
Nov 13 03:50:42 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:42 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:42 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:42 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533908 533909 0x0 SD]
Nov 13 03:50:47 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:47 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:47 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:47 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533910 533911 0x0 SD]
Nov 13 03:50:53 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:50:53 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299977, sector=942392
Nov 13 03:50:53 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942392
Nov 13 03:50:53 atlanta kernel: vs-13070:
reiserfs_read_inode2: i/o failure occurred trying to find
stat data of [533910 533912 0x0 SD]
Nov 13 03:51:00 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:00 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299969, sector=942384
Nov 13 03:51:00 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:06 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:06 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299970, sector=942384
Nov 13 03:51:06 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:11 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:11 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299970, sector=942384
Nov 13 03:51:11 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:20 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:20 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299975, sector=942384
Nov 13 03:51:20 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:25 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:31 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299975, sector=942384
Nov 13 03:51:31 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:31 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:31 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299975, sector=942384
Nov 13 03:51:31 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:36 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:36 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299975, sector=942384
Nov 13 03:51:36 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:41 atlanta kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }
Nov 13 03:51:41 atlanta kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=4299975, sector=942384
Nov 13 03:51:41 atlanta kernel: end_request: I/O error, dev
03:03 (hda), sector 942384
Nov 13 03:51:41 atlanta kernel: zam-7001: io error in
reiserfs_find_entry
Nov 13 03:51:41 atlanta kernel: kernel BUG at prints.c:334!
Nov 13 03:51:41 atlanta kernel: invalid operand: 0000
Nov 13 03:51:41 atlanta kernel: CPU:    0
Nov 13 03:51:42 atlanta kernel: EIP:    0010:[<c0171cbb>]
Not tainted
Nov 13 03:51:42 atlanta kernel: EFLAGS: 00010286
Nov 13 03:51:42 atlanta kernel: eax: 0000001c   ebx:
c0253580   ecx: c029eec0   edx: 00003008
Nov 13 03:51:42 atlanta kernel: esi: c1843e00   edi:
cf9bfe84   ebp: cf9bfe18   esp: cf9bfdd8
Nov 13 03:51:42 atlanta kernel: ds: 0018   es: 0018   ss:
0018
Nov 13 03:51:42 atlanta kernel: Process nfsd (pid: 2821,
stackpage=cf9bf000)
Nov 13 03:51:42 atlanta kernel: Stack: c0257ca6 0000014e
cf9bfe00 c02535ec c0165aa8 c1843e00 c0253580 00000000
Nov 13 03:51:42 atlanta kernel:        c02535ec cf9bfe44
000824aa 0008264e 7d9bf77f 000001f4 00000000 00000003
Nov 13 03:51:42 atlanta kernel:        d789c6e0 c0165bc0
cd5599c0 d789c73c 00000007 cf9bfe44 cf9bfe84 fffffff4
Nov 13 03:51:42 atlanta kernel: Call Trace: [<c0165aa8>]
[<c0165bc0>] [<c0146b03>] [<c013e74d>] [<c013e7fb>]
Nov 13 03:51:42 atlanta kernel:    [<e0b352be>] [<c010867b>]
[<e0b3aa58>] [<e0b41fc0>] [<e0b325a3>] [<e0b41fc0>]
Nov 13 03:51:42 atlanta kernel:    [<e0b198c5>] [<e0b419d8>]
[<e0b3238f>] [<e0b419c0>] [<c01055b4>]
Nov 13 03:51:42 atlanta kernel:
Nov 13 03:51:42 atlanta kernel: Code: 0f 0b 83 c4 08 68 e0
70 31 c0 85 f6 74 0f 0f b7 46 08 50 e8

The decoded oops:

atlanta:~ # dmesg|ksymoops -m /usr/src/linux/System.map
ksymoops 2.4.1 on i686 2.4.15-pre1.  Options used
     -V (default)
     -k /proc/ksyms (default)
     -l /proc/modules (default)
     -o /lib/modules/2.4.15-pre1/ (default)
     -m /usr/src/linux/System.map (specified)

Warning (compare_maps): mismatch on symbol nlmsvc_ops  ,
lockd says e0b30a90,
/lib/modules/2.4.15-pre1/kernel/fs/lockd/lockd.o says
e0b2feec.  Ignoring
/lib/modules/2.4.15-pre1/kernel/fs/lockd/lockd.o entry
Warning (compare_maps): mismatch on symbol nfs_debug  ,
sunrpc says e0b23004,
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o says
e0b22ce4.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol nfsd_debug  ,
sunrpc says e0b23008,
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o says
e0b22ce8.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol nlm_debug  ,
sunrpc says e0b2300c,
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o says
e0b22cec.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol rpc_debug  ,
sunrpc says e0b23000,
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o says
e0b22ce0.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/sunrpc/sunrpc.o entry
Warning (compare_maps): mismatch on symbol icmpv6_socket  ,
ipv6 says e0af32e0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0af0460.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol
icmpv6_statistics  , ipv6 says e0af12e0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aee460.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inet6_dev_count
, ipv6 says e0aef000,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aec180.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inet6_ifa_count
, ipv6 says e0aef004,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aec184.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inet6_protos  ,
ipv6 says e0af1260,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aee3e0.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol inetsw6  , ipv6
says e0aeefa0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aec120.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol ip6_ra_chain  ,
ipv6 says e0af09a0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aedb20.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol ipv6_statistics
, ipv6 says e0aef1a0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aec320.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol raw_v6_htable  ,
ipv6 says e0af11e0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aee360.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol rt6_stats  , ipv6
says e0aef168,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aec2e8.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
Warning (compare_maps): mismatch on symbol udp_stats_in6  ,
ipv6 says e0af09e0,
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o says
e0aedb60.  Ignoring
/lib/modules/2.4.15-pre1/kernel/net/ipv6/ipv6.o entry
cpu: 0, clocks: 1335718, slice: 667859
kernel BUG at prints.c:334!
invalid operand: 0000
CPU:    0
EIP:    0010:[<c0171cbb>]    Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010286
eax: 0000001c   ebx: c0253580   ecx: c029eec0   edx:
00003008
esi: c1843e00   edi: cf9bfe84   ebp: cf9bfe18   esp:
cf9bfdd8
ds: 0018   es: 0018   ss: 0018
Process nfsd (pid: 2821, stackpage=cf9bf000)
Stack: c0257ca6 0000014e cf9bfe00 c02535ec c0165aa8 c1843e00
c0253580 00000000
       c02535ec cf9bfe44 000824aa 0008264e 7d9bf77f 000001f4
00000000 00000003
       d789c6e0 c0165bc0 cd5599c0 d789c73c 00000007 cf9bfe44
cf9bfe84 fffffff4
Call Trace: [<c0165aa8>] [<c0165bc0>] [<c0146b03>]
[<c013e74d>] [<c013e7fb>]
   [<e0b352be>] [<c010867b>] [<e0b3aa58>] [<e0b41fc0>]
[<e0b325a3>] [<e0b41fc0>]
   [<e0b198c5>] [<e0b419d8>] [<e0b3238f>] [<e0b419c0>]
[<c01055b4>]
Code: 0f 0b 83 c4 08 68 e0 70 31 c0 85 f6 74 0f 0f b7 46 08
50 e8

>>EIP; c0171cbb <reiserfs_panic+3b/68>   <=====
Trace; c0165aa8 <reiserfs_find_entry+88/134>
Trace; c0165bc0 <reiserfs_lookup+6c/cc>
Trace; c0146b03 <d_alloc+1b/188>
Trace; c013e74d <lookup_hash+9d/f0>
Trace; c013e7fb <lookup_one_len+5b/6c>
Trace; e0b352be <[nfsd]nfsd_lookup+32e/468>
Trace; c010867b <do_IRQ+db/ec>
Trace; e0b3aa58 <[nfsd]nfsd3_proc_lookup+d4/e0>
Trace; e0b41fc0 <[nfsd]nfsd_procedures3+60/2c0>
Trace; e0b325a3 <[nfsd]nfsd_dispatch+d3/1a0>
Trace; e0b41fc0 <[nfsd]nfsd_procedures3+60/2c0>
Trace; e0b198c5 <[sunrpc]svc_process+28d/4d8>
Trace; e0b419d8 <[nfsd]nfsd_version3+0/10>
Trace; e0b3238f <[nfsd]nfsd+207/348>
Trace; e0b419c0 <[nfsd]nfsd_list+0/0>
Trace; c01055b4 <kernel_thread+28/38>
Code;  c0171cbb <reiserfs_panic+3b/68>
00000000 <_EIP>:
Code;  c0171cbb <reiserfs_panic+3b/68>   <=====
   0:   0f 0b                     ud2a      <=====
Code;  c0171cbd <reiserfs_panic+3d/68>
   2:   83 c4 08                  add    $0x8,%esp
Code;  c0171cc0 <reiserfs_panic+40/68>
   5:   68 e0 70 31 c0            push   $0xc03170e0
Code;  c0171cc5 <reiserfs_panic+45/68>
   a:   85 f6                     test   %esi,%esi
Code;  c0171cc7 <reiserfs_panic+47/68>
   c:   74 0f                     je     1d <_EIP+0x1d>
c0171cd8 <reiserfs_panic+58/68>
Code;  c0171cc9 <reiserfs_panic+49/68>
   e:   0f b7 46 08               movzwl 0x8(%esi),%eax
Code;  c0171ccd <reiserfs_panic+4d/68>
  12:   50                        push   %eax
Code;  c0171cce <reiserfs_panic+4e/68>
  13:   e8 00 00 00 00            call   18 <_EIP+0x18>
c0171cd3 <reiserfs_panic+53/68>


16 warnings issued.  Results may not be reliable.


The kernel is

Linux atlanta 2.4.15-pre1 #2 SMP Fri Nov 16 23:08:06 CET
2001 i686 unknown

and the disk is "IBM-DTLA-307075" (is that drive known for
failures?)


Is that bug known?
Is the filesystem in a state so that it needs a reiserfsck,
or is it sufficient to delete the data whose data or
metadata touches the bad blocks?
If a reiserfsck is needed, is there a ramdisk distribution
(kit) which allows you to do the work completely from the
network?

Xu�n.

--
Mit freundlichen Gr��en

Xu�n Baldauf
Medium.net Internet Server Software


Reply via email to