Hi at linux-raid

I just happend to get a major recoveryd crash by NULLPOINTER dereference
The system ist up and working though, with the raid-disk
mounted system messages below. So if you want any more information from
the running system, email me, I will look at the mails on 22:15 again.

It is a 2.2.16-raid0.90 without any other changes or patch errors
I configured a zip disk with 2 partititions as raid1-autostart array,
did some mounting/taring/unmounting/raidstop/raidstart tests.
Everything worked fine. Then I did  the following:

Console 2: raidsetfaulty /dev/hdb2
Console 2: raidhotremove /dev/hdb2
Console 1: tar xvzf something totheraidarray
Console 2 while tar was running: raidhotadd /dev/hdb2

now when I do a cat /proc/mdstat it shows:

Personalities : [linear] [raid0] [raid1] [raid5] [translucent] 
read_ahead 1024 sectors
md0 : active raid1 hdb2[2] hdb1[0] 49024 blocks [2/1] [U_] recovery=0%
finish=10168.0min
unused devices: <none>

this message doesn't change anymore expect the finish time continuesly
increases -,^((

any comments? 

bye
        Ulf


-----raid autostart ---------------8<------------------8<---------------

Sep  5 21:16:37 ulda2 kernel: VFS: Disk change detected on device
ide0(3,65)
Sep  5 21:16:37 ulda2 kernel:  hdb: hdb1 hdb2
Sep  5 21:16:37 ulda2 kernel: (read) hdb1's sb offset: 49024 [events:
000000c5]
Sep  5 21:16:37 ulda2 kernel: (read) hdb2's sb offset: 49088 [events:
000000c5]
Sep  5 21:16:37 ulda2 kernel: autorun ...
Sep  5 21:16:37 ulda2 kernel: considering hdb2 ...
Sep  5 21:16:37 ulda2 kernel:   adding hdb2 ...
Sep  5 21:16:37 ulda2 kernel:   adding hdb1 ...
Sep  5 21:16:37 ulda2 kernel: created md0
Sep  5 21:16:37 ulda2 kernel: bind<hdb1,1>
Sep  5 21:16:37 ulda2 kernel: md0: WARNING: hdb2 appears to be on the
same physical disk as hdb1. True
Sep  5 21:16:37 ulda2 kernel:      protection against single-disk
failure might be compromised.
Sep  5 21:16:37 ulda2 kernel: bind<hdb2,2>
Sep  5 21:16:37 ulda2 kernel: running: <hdb2><hdb1>
Sep  5 21:16:37 ulda2 kernel: now!
Sep  5 21:16:37 ulda2 kernel: hdb2's event counter: 000000c5
Sep  5 21:16:37 ulda2 kernel: hdb1's event counter: 000000c5
Sep  5 21:16:37 ulda2 kernel: md0: max total readahead window set to
128k
Sep  5 21:16:37 ulda2 kernel: md0: 1 data-disks, max readahead per
data-disk: 128k
Sep  5 21:16:37 ulda2 kernel: raid1: device hdb2 operational as mirror 1
Sep  5 21:16:37 ulda2 kernel: raid1: device hdb1 operational as mirror 0
Sep  5 21:16:37 ulda2 kernel: (checking disk 0)
Sep  5 21:16:37 ulda2 kernel: (really checking disk 0)
Sep  5 21:16:37 ulda2 kernel: (checking disk 1)
Sep  5 21:16:37 ulda2 kernel: (really checking disk 1)
Sep  5 21:16:37 ulda2 kernel: (checking disk 2)
Sep  5 21:16:37 ulda2 kernel: (checking disk 3)
Sep  5 21:16:37 ulda2 kernel: (checking disk 4)
Sep  5 21:16:37 ulda2 kernel: (checking disk 5)
Sep  5 21:16:37 ulda2 kernel: (checking disk 6)
Sep  5 21:16:37 ulda2 kernel: (checking disk 7)
Sep  5 21:16:37 ulda2 kernel: (checking disk 8)
Sep  5 21:16:37 ulda2 kernel: (checking disk 9)
Sep  5 21:16:37 ulda2 kernel: (checking disk 10)
Sep  5 21:16:37 ulda2 kernel: (checking disk 11)
Sep  5 21:16:37 ulda2 kernel: raid1: raid set md0 active with 2 out of 2
mirrors
Sep  5 21:16:37 ulda2 kernel: md: updating md0 RAID superblock on device
Sep  5 21:16:37 ulda2 kernel: hdb2 [events: 000000c6](write) hdb2's sb
offset: 49088
Sep  5 21:16:37 ulda2 kernel: hdb1 [events: 000000c6](write) hdb1's sb
offset: 49024
Sep  5 21:16:37 ulda2 kernel: .
Sep  5 21:16:37 ulda2 kernel: ... autorun DONE.

----after raidsetfaulty ------------8<---------------8<---------

Sep  5 21:21:18 ulda2 kernel: raid1: Disk failure on hdb2, disabling
device. 
Sep  5 21:21:18 ulda2 kernel:        Operation continuing on 1 devices
Sep  5 21:21:18 ulda2 kernel: md: recovery thread got woken up ...
Sep  5 21:21:18 ulda2 kernel: md0: no spare disk to reconstruct array!
-- continuing in degraded mode
Sep  5 21:21:18 ulda2 kernel: md: recovery thread finished ...

---- after raidhotremove --------8<------------------8<------------

Sep  5 21:22:23 ulda2 kernel: trying to remove hdb2 from md0 ... 
Sep  5 21:22:23 ulda2 kernel: RAID1 conf printout:
Sep  5 21:22:23 ulda2 kernel:  --- wd:1 rd:2 nd:2
Sep  5 21:22:23 ulda2 kernel:  disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep  5 21:22:23 ulda2 kernel:  disk 1, s:0, o:0, n:1 rd:1 us:1 dev:hdb2
Sep  5 21:22:23 ulda2 kernel:  disk 2, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel: RAID1 conf printout:
Sep  5 21:22:23 ulda2 kernel:  --- wd:1 rd:2 nd:1
Sep  5 21:22:23 ulda2 kernel:  disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep  5 21:22:23 ulda2 kernel:  disk 1, s:0, o:0, n:1 rd:1 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 2, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel:  disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:22:23 ulda2 kernel: unbind<hdb2,1>
Sep  5 21:22:23 ulda2 kernel: export_rdev(hdb2)
Sep  5 21:22:23 ulda2 kernel: md: updating md0 RAID superblock on device
Sep  5 21:22:23 ulda2 kernel: hdb1 [events: 000000c7](write) hdb1's sb
offset: 49024
Sep  5 21:22:25 ulda2 kernel: .

--------- after raidhotadd --------8<------------------8<---------

Sep  5 21:23:41 ulda2 kernel: trying to hot-add hdb2 to md0 ... 
Sep  5 21:23:41 ulda2 kernel: md0: WARNING: hdb2 appears to be on the
same physical disk as hdb1. True
Sep  5 21:23:41 ulda2 kernel:      protection against single-disk
failure might be compromised.
Sep  5 21:23:41 ulda2 kernel: bind<hdb2,2>
Sep  5 21:23:41 ulda2 kernel: RAID1 conf printout:
Sep  5 21:23:41 ulda2 kernel:  --- wd:1 rd:2 nd:1
Sep  5 21:23:41 ulda2 kernel:  disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep  5 21:23:41 ulda2 kernel:  disk 1, s:0, o:0, n:1 rd:1 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 2, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel: RAID1 conf printout:
Sep  5 21:23:41 ulda2 kernel:  --- wd:1 rd:2 nd:2
Sep  5 21:23:41 ulda2 kernel:  disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep  5 21:23:41 ulda2 kernel:  disk 1, s:0, o:0, n:1 rd:1 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 2, s:1, o:0, n:2 rd:2 us:1 dev:hdb2
Sep  5 21:23:41 ulda2 kernel:  disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel:  disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:41 ulda2 kernel: md: updating md0 RAID superblock on device
Sep  5 21:23:41 ulda2 kernel: hdb2 [events: 000000c8](write) hdb2's sb
offset: 49088
Sep  5 21:23:41 ulda2 kernel: hdb1 [events: 000000c8](write) hdb1's sb
offset: 49024
Sep  5 21:23:42 ulda2 kernel: .
Sep  5 21:23:42 ulda2 kernel: md: recovery thread got woken up ...
Sep  5 21:23:42 ulda2 kernel: md0: resyncing spare disk hdb2 to replace
failed disk
Sep  5 21:23:42 ulda2 kernel: RAID1 conf printout:
Sep  5 21:23:42 ulda2 kernel:  --- wd:1 rd:2 nd:2
Sep  5 21:23:42 ulda2 kernel:  disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep  5 21:23:42 ulda2 kernel:  disk 1, s:0, o:0, n:1 rd:1 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 2, s:1, o:0, n:2 rd:2 us:1 dev:hdb2
Sep  5 21:23:42 ulda2 kernel:  disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel: RAID1 conf printout:
Sep  5 21:23:42 ulda2 kernel:  --- wd:1 rd:2 nd:2
Sep  5 21:23:42 ulda2 kernel:  disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep  5 21:23:42 ulda2 kernel:  disk 1, s:0, o:0, n:1 rd:1 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 2, s:1, o:1, n:2 rd:2 us:1 dev:hdb2
Sep  5 21:23:42 ulda2 kernel:  disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel:  disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep  5 21:23:42 ulda2 kernel: md: syncing RAID array md0
Sep  5 21:23:42 ulda2 kernel: md: minimum _guaranteed_ reconstruction
speed: 100 KB/sec.
Sep  5 21:23:42 ulda2 kernel: md: using maximum available idle IO
bandwith for reconstruction.
Sep  5 21:23:42 ulda2 kernel: md: using 128k window.

---- BANG! ------------ 8-( --------------- 8-( ----------------

Sep  5 21:23:42 ulda2 kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000000
Sep  5 21:23:42 ulda2 kernel: current->tss.cr3 = 00101000, %cr3 =
00101000
Sep  5 21:23:42 ulda2 kernel: *pde = 00000000
Sep  5 21:23:42 ulda2 kernel: Oops: 0000
Sep  5 21:23:42 ulda2 kernel: CPU:    0
Sep  5 21:23:42 ulda2 kernel: EIP:    0010:[<00000000>]
Sep  5 21:23:42 ulda2 kernel: EFLAGS: 00010002
Sep  5 21:23:42 ulda2 kernel: eax: 00000000   ebx: 00000246   ecx:
00000001   edx: c0272d00
Sep  5 21:23:42 ulda2 kernel: esi: 00000080   edi: c694a000   ebp:
00000000   esp: c7dd7f24
Sep  5 21:23:42 ulda2 kernel: ds: 0018   es: 0018   ss: 0018
Sep  5 21:23:42 ulda2 kernel: Process mdrecoveryd (pid: 6, process nr:
6, stackpage=c7dd7000)
Sep  5 21:23:42 ulda2 kernel: Stack: c0192ce2 c0272d00 c7daf1e0 c4f41000
c7dd7fd4 c7daf22c 00000080 00000002 
Sep  5 21:23:42 ulda2 kernel:        c7daf1e0 c4f41000 c7dd6000 c7dd6000
00000004 c7dd6000 00000000 c02415e0 
Sep  5 21:23:42 ulda2 kernel:        00000000 0000007f 00090000 00000001
c7dd6000 00000024 00000900 c7daf238 
Sep  5 21:23:42 ulda2 kernel: Call Trace: [md_do_sync+1134/2700]
[md_do_recovery+234/580] [md_thread+167/316] [kernel_thread+35/48] 
Sep  5 21:23:42 ulda2 kernel: Code: Bad EIP value.
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]

Reply via email to