Hi at linux-raid
I just happend to get a major recoveryd crash by NULLPOINTER dereference
The system ist up and working though, with the raid-disk
mounted system messages below. So if you want any more information from
the running system, email me, I will look at the mails on 22:15 again.
It is a 2.2.16-raid0.90 without any other changes or patch errors
I configured a zip disk with 2 partititions as raid1-autostart array,
did some mounting/taring/unmounting/raidstop/raidstart tests.
Everything worked fine. Then I did the following:
Console 2: raidsetfaulty /dev/hdb2
Console 2: raidhotremove /dev/hdb2
Console 1: tar xvzf something totheraidarray
Console 2 while tar was running: raidhotadd /dev/hdb2
now when I do a cat /proc/mdstat it shows:
Personalities : [linear] [raid0] [raid1] [raid5] [translucent]
read_ahead 1024 sectors
md0 : active raid1 hdb2[2] hdb1[0] 49024 blocks [2/1] [U_] recovery=0%
finish=10168.0min
unused devices: <none>
this message doesn't change anymore expect the finish time continuesly
increases -,^((
any comments?
bye
Ulf
-----raid autostart ---------------8<------------------8<---------------
Sep 5 21:16:37 ulda2 kernel: VFS: Disk change detected on device
ide0(3,65)
Sep 5 21:16:37 ulda2 kernel: hdb: hdb1 hdb2
Sep 5 21:16:37 ulda2 kernel: (read) hdb1's sb offset: 49024 [events:
000000c5]
Sep 5 21:16:37 ulda2 kernel: (read) hdb2's sb offset: 49088 [events:
000000c5]
Sep 5 21:16:37 ulda2 kernel: autorun ...
Sep 5 21:16:37 ulda2 kernel: considering hdb2 ...
Sep 5 21:16:37 ulda2 kernel: adding hdb2 ...
Sep 5 21:16:37 ulda2 kernel: adding hdb1 ...
Sep 5 21:16:37 ulda2 kernel: created md0
Sep 5 21:16:37 ulda2 kernel: bind<hdb1,1>
Sep 5 21:16:37 ulda2 kernel: md0: WARNING: hdb2 appears to be on the
same physical disk as hdb1. True
Sep 5 21:16:37 ulda2 kernel: protection against single-disk
failure might be compromised.
Sep 5 21:16:37 ulda2 kernel: bind<hdb2,2>
Sep 5 21:16:37 ulda2 kernel: running: <hdb2><hdb1>
Sep 5 21:16:37 ulda2 kernel: now!
Sep 5 21:16:37 ulda2 kernel: hdb2's event counter: 000000c5
Sep 5 21:16:37 ulda2 kernel: hdb1's event counter: 000000c5
Sep 5 21:16:37 ulda2 kernel: md0: max total readahead window set to
128k
Sep 5 21:16:37 ulda2 kernel: md0: 1 data-disks, max readahead per
data-disk: 128k
Sep 5 21:16:37 ulda2 kernel: raid1: device hdb2 operational as mirror 1
Sep 5 21:16:37 ulda2 kernel: raid1: device hdb1 operational as mirror 0
Sep 5 21:16:37 ulda2 kernel: (checking disk 0)
Sep 5 21:16:37 ulda2 kernel: (really checking disk 0)
Sep 5 21:16:37 ulda2 kernel: (checking disk 1)
Sep 5 21:16:37 ulda2 kernel: (really checking disk 1)
Sep 5 21:16:37 ulda2 kernel: (checking disk 2)
Sep 5 21:16:37 ulda2 kernel: (checking disk 3)
Sep 5 21:16:37 ulda2 kernel: (checking disk 4)
Sep 5 21:16:37 ulda2 kernel: (checking disk 5)
Sep 5 21:16:37 ulda2 kernel: (checking disk 6)
Sep 5 21:16:37 ulda2 kernel: (checking disk 7)
Sep 5 21:16:37 ulda2 kernel: (checking disk 8)
Sep 5 21:16:37 ulda2 kernel: (checking disk 9)
Sep 5 21:16:37 ulda2 kernel: (checking disk 10)
Sep 5 21:16:37 ulda2 kernel: (checking disk 11)
Sep 5 21:16:37 ulda2 kernel: raid1: raid set md0 active with 2 out of 2
mirrors
Sep 5 21:16:37 ulda2 kernel: md: updating md0 RAID superblock on device
Sep 5 21:16:37 ulda2 kernel: hdb2 [events: 000000c6](write) hdb2's sb
offset: 49088
Sep 5 21:16:37 ulda2 kernel: hdb1 [events: 000000c6](write) hdb1's sb
offset: 49024
Sep 5 21:16:37 ulda2 kernel: .
Sep 5 21:16:37 ulda2 kernel: ... autorun DONE.
----after raidsetfaulty ------------8<---------------8<---------
Sep 5 21:21:18 ulda2 kernel: raid1: Disk failure on hdb2, disabling
device.
Sep 5 21:21:18 ulda2 kernel: Operation continuing on 1 devices
Sep 5 21:21:18 ulda2 kernel: md: recovery thread got woken up ...
Sep 5 21:21:18 ulda2 kernel: md0: no spare disk to reconstruct array!
-- continuing in degraded mode
Sep 5 21:21:18 ulda2 kernel: md: recovery thread finished ...
---- after raidhotremove --------8<------------------8<------------
Sep 5 21:22:23 ulda2 kernel: trying to remove hdb2 from md0 ...
Sep 5 21:22:23 ulda2 kernel: RAID1 conf printout:
Sep 5 21:22:23 ulda2 kernel: --- wd:1 rd:2 nd:2
Sep 5 21:22:23 ulda2 kernel: disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep 5 21:22:23 ulda2 kernel: disk 1, s:0, o:0, n:1 rd:1 us:1 dev:hdb2
Sep 5 21:22:23 ulda2 kernel: disk 2, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: RAID1 conf printout:
Sep 5 21:22:23 ulda2 kernel: --- wd:1 rd:2 nd:1
Sep 5 21:22:23 ulda2 kernel: disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep 5 21:22:23 ulda2 kernel: disk 1, s:0, o:0, n:1 rd:1 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 2, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:22:23 ulda2 kernel: unbind<hdb2,1>
Sep 5 21:22:23 ulda2 kernel: export_rdev(hdb2)
Sep 5 21:22:23 ulda2 kernel: md: updating md0 RAID superblock on device
Sep 5 21:22:23 ulda2 kernel: hdb1 [events: 000000c7](write) hdb1's sb
offset: 49024
Sep 5 21:22:25 ulda2 kernel: .
--------- after raidhotadd --------8<------------------8<---------
Sep 5 21:23:41 ulda2 kernel: trying to hot-add hdb2 to md0 ...
Sep 5 21:23:41 ulda2 kernel: md0: WARNING: hdb2 appears to be on the
same physical disk as hdb1. True
Sep 5 21:23:41 ulda2 kernel: protection against single-disk
failure might be compromised.
Sep 5 21:23:41 ulda2 kernel: bind<hdb2,2>
Sep 5 21:23:41 ulda2 kernel: RAID1 conf printout:
Sep 5 21:23:41 ulda2 kernel: --- wd:1 rd:2 nd:1
Sep 5 21:23:41 ulda2 kernel: disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep 5 21:23:41 ulda2 kernel: disk 1, s:0, o:0, n:1 rd:1 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 2, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: RAID1 conf printout:
Sep 5 21:23:41 ulda2 kernel: --- wd:1 rd:2 nd:2
Sep 5 21:23:41 ulda2 kernel: disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep 5 21:23:41 ulda2 kernel: disk 1, s:0, o:0, n:1 rd:1 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 2, s:1, o:0, n:2 rd:2 us:1 dev:hdb2
Sep 5 21:23:41 ulda2 kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:41 ulda2 kernel: md: updating md0 RAID superblock on device
Sep 5 21:23:41 ulda2 kernel: hdb2 [events: 000000c8](write) hdb2's sb
offset: 49088
Sep 5 21:23:41 ulda2 kernel: hdb1 [events: 000000c8](write) hdb1's sb
offset: 49024
Sep 5 21:23:42 ulda2 kernel: .
Sep 5 21:23:42 ulda2 kernel: md: recovery thread got woken up ...
Sep 5 21:23:42 ulda2 kernel: md0: resyncing spare disk hdb2 to replace
failed disk
Sep 5 21:23:42 ulda2 kernel: RAID1 conf printout:
Sep 5 21:23:42 ulda2 kernel: --- wd:1 rd:2 nd:2
Sep 5 21:23:42 ulda2 kernel: disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep 5 21:23:42 ulda2 kernel: disk 1, s:0, o:0, n:1 rd:1 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 2, s:1, o:0, n:2 rd:2 us:1 dev:hdb2
Sep 5 21:23:42 ulda2 kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: RAID1 conf printout:
Sep 5 21:23:42 ulda2 kernel: --- wd:1 rd:2 nd:2
Sep 5 21:23:42 ulda2 kernel: disk 0, s:0, o:1, n:0 rd:0 us:1 dev:hdb1
Sep 5 21:23:42 ulda2 kernel: disk 1, s:0, o:0, n:1 rd:1 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 2, s:1, o:1, n:2 rd:2 us:1 dev:hdb2
Sep 5 21:23:42 ulda2 kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev
00:00]
Sep 5 21:23:42 ulda2 kernel: md: syncing RAID array md0
Sep 5 21:23:42 ulda2 kernel: md: minimum _guaranteed_ reconstruction
speed: 100 KB/sec.
Sep 5 21:23:42 ulda2 kernel: md: using maximum available idle IO
bandwith for reconstruction.
Sep 5 21:23:42 ulda2 kernel: md: using 128k window.
---- BANG! ------------ 8-( --------------- 8-( ----------------
Sep 5 21:23:42 ulda2 kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000000
Sep 5 21:23:42 ulda2 kernel: current->tss.cr3 = 00101000, %cr3 =
00101000
Sep 5 21:23:42 ulda2 kernel: *pde = 00000000
Sep 5 21:23:42 ulda2 kernel: Oops: 0000
Sep 5 21:23:42 ulda2 kernel: CPU: 0
Sep 5 21:23:42 ulda2 kernel: EIP: 0010:[<00000000>]
Sep 5 21:23:42 ulda2 kernel: EFLAGS: 00010002
Sep 5 21:23:42 ulda2 kernel: eax: 00000000 ebx: 00000246 ecx:
00000001 edx: c0272d00
Sep 5 21:23:42 ulda2 kernel: esi: 00000080 edi: c694a000 ebp:
00000000 esp: c7dd7f24
Sep 5 21:23:42 ulda2 kernel: ds: 0018 es: 0018 ss: 0018
Sep 5 21:23:42 ulda2 kernel: Process mdrecoveryd (pid: 6, process nr:
6, stackpage=c7dd7000)
Sep 5 21:23:42 ulda2 kernel: Stack: c0192ce2 c0272d00 c7daf1e0 c4f41000
c7dd7fd4 c7daf22c 00000080 00000002
Sep 5 21:23:42 ulda2 kernel: c7daf1e0 c4f41000 c7dd6000 c7dd6000
00000004 c7dd6000 00000000 c02415e0
Sep 5 21:23:42 ulda2 kernel: 00000000 0000007f 00090000 00000001
c7dd6000 00000024 00000900 c7daf238
Sep 5 21:23:42 ulda2 kernel: Call Trace: [md_do_sync+1134/2700]
[md_do_recovery+234/580] [md_thread+167/316] [kernel_thread+35/48]
Sep 5 21:23:42 ulda2 kernel: Code: Bad EIP value.
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]