Public bug reported:
Binary package hint: udev
My system was correctly booting from a mirrored PATA (via libata) RAID 1
until one of the disks was removed. I then hit the "degraded mode
doesn't boot" problem for a while. This all happened under Gutsy (kept
up to date daily.)
After that I replaced the missing disk, while the system was powered off, and
then:
* booted with 'break=mount' on the kernel command line.
* waited until detection of hardware had completed.
* ran 'exec mdadm -As' to detect the degraded RAID and continue to boot
* ran 'sfdisk -d /dev/sda | sfdisk /dev/sdb' to partition the new disk
* ran 'mdadm -a /dev/md0 /dev/sdb1'
* ran 'mdadm -a /dev/md1 /dev/sdb2'
* waited for the system to resync the data
* checked that /proc/mdstat showed all healthy
* rebooted
At this point I expected, naturally, to have the system boot cleanly
without any problems or delay. Instead the system simply ground to a
halt after the three minute boot timeout without the RAID detected.
After some investigation this looks, to me, like a problem with
identification of the use of the device components.
The udev rules for mdadm depend on ENV{ID_FS_TYPE}=="linux_raid*" to run
mdadm at all; for my RAID components I get the following details:
UDEV [1187654052.620855] add /block/sda/sda1 (block)
UDEV_LOG=3
ACTION=add
DEVPATH=/block/sda/sda1
SUBSYSTEM=block
SEQNUM=1750
MINOR=1
MAJOR=8
PHYSDEVPATH=/devices/pci0000:00/0000:00:1f.1/host0/target0:0:0/0:0:0:0
PHYSDEVBUS=scsi
PHYSDEVDRIVER=sd
UDEVD_EVENT=1
DEVTYPE=partition
ID_VENDOR=ATA
ID_MODEL=SAMSUNG_MP0804H
ID_REVISION=UE10
ID_SERIAL=1ATA_SAMSUNG_MP0804H_S042J10Y257961
ID_SERIAL_SHORT=ATA_SAMSUNG_MP0804H_S042J10Y257961
ID_TYPE=disk
ID_BUS=scsi
ID_ATA_COMPAT=SAMSUNG_MP0804H_S042J10Y257961
ID_PATH=pci-0000:00:1f.1-scsi-0:0:0:0
ID_FS_USAGE=filesystem
ID_FS_TYPE=ext3
ID_FS_VERSION=1.0
ID_FS_UUID=f93b5509-6e68-4f2f-9d2f-fcff7a2dfb19
ID_FS_UUID_ENC=f93b5509-6e68-4f2f-9d2f-fcff7a2dfb19
ID_FS_LABEL=enki-root
ID_FS_LABEL_ENC=enki-root
ID_FS_LABEL_SAFE=enki-root
DEVNAME=/dev/sda1
DEVLINKS=/dev/disk/by-id/scsi-1ATA_SAMSUNG_MP0804H_S042J10Y257961-part1
/dev/disk/by-id/ata-SAMSUNG_MP0804H_S042J10Y257961-part1
/dev/disk/by-path/pci-0000:00:1f.1-scsi-0:0:0:0-part1
/dev/disk/by-uuid/f93b5509-6e68-4f2f-9d2f-fcff7a2dfb19
/dev/disk/by-label/enki-root
Note that the 'ID_FS_TYPE' value is ext3, the file system in the RAID
array, rather than identifying this disk as part of a RAID array.
The same misidentification is present for the swap RAID1 and the other
component; I can supply logs showing that if it matters.
The RAID array itself is a healthy RAID1 with version 1.0 metadata:
[EMAIL PROTECTED]:~$ cat /proc/mdstat
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] [raid4]
[raid10]
md1 : active raid1 sda2[0] sdb2[1]
1975984 blocks super 1.0 [2/2] [UU]
md0 : active raid1 sdb1[0] sda1[2]
76204224 blocks super 1.0 [2/2] [UU]
unused devices: <none>
[EMAIL PROTECTED]:~$ sudo mdadm -D /dev/md0
[sudo] password for daniel:
/dev/md0:
Version : 01.00.03
Creation Time : Wed May 16 01:08:07 2007
Raid Level : raid1
Array Size : 76204224 (72.67 GiB 78.03 GB)
Used Dev Size : 152408448 (72.67 GiB 78.03 GB)
Raid Devices : 2
Total Devices : 2
Preferred Minor : 0
Persistence : Superblock is persistent
Update Time : Tue Aug 21 10:21:04 2007
State : active
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0
Name : enki-root
UUID : 3a6b05ca:c218089c:f9df5f76:731157e8
Events : 802619
Number Major Minor RaidDevice State
0 8 17 0 active sync /dev/sdb1
2 8 1 1 active sync /dev/sda1
However, one very odd factor in this:
[EMAIL PROTECTED]:~$ sudo mdadm -E /dev/sda1
/dev/sda1:
Magic : a92b4efc
Version : 01
Feature Map : 0x0
Array UUID : 3a6b05ca:c218089c:f9df5f76:731157e8
Name : enki-root
Creation Time : Wed May 16 01:08:07 2007
Raid Level : raid1
Raid Devices : 2
Used Dev Size : 152408448 (72.67 GiB 78.03 GB)
Array Size : 152408448 (72.67 GiB 78.03 GB)
Super Offset : 152408576 sectors
State : active
Device UUID : fc1de0a0:88071249:b753809d:3cd1beee
Update Time : Tue Aug 21 10:21:24 2007
Checksum : 655ae10b - correct
Events : 802619
Array Slot : 2 (0, failed, 1)
Array State : uU 1 failed
[EMAIL PROTECTED]:~$ sudo mdadm -E /dev/sdb1
/dev/sdb1:
Magic : a92b4efc
Version : 01
Feature Map : 0x0
Array UUID : 3a6b05ca:c218089c:f9df5f76:731157e8
Name : enki-root
Creation Time : Wed May 16 01:08:07 2007
Raid Level : raid1
Raid Devices : 2
Used Dev Size : 152408448 (72.67 GiB 78.03 GB)
Array Size : 152408448 (72.67 GiB 78.03 GB)
Super Offset : 152408576 sectors
State : active
Device UUID : 71575927:ef34548e:5693969d:1b8f6631
Update Time : Tue Aug 21 10:21:26 2007
Checksum : 73d44564 - correct
Events : 802619
Array Slot : 0 (0, failed, 1)
Array State : Uu 1 failed
It looks like the MD device has a happy RAID1 header but the individual
components have metadata that indicates that /both/ of them are part of a
failed RAID array.
In any case the misidentification of the component use means that mdadm
is never called on my system.
Please let me know if I can assist in debugging this further or in
providing any specific testing. I have no concerns building whatever
tools with debugging, etc, as needed.
** Affects: udev (Ubuntu)
Importance: Undecided
Status: New
--
[gutsy] partitions no longer detected as RAID components after repairing
degraded RAID 1 mirror
https://bugs.launchpad.net/bugs/133773
You received this bug notification because you are a member of Ubuntu
Bugs, which is the bug contact for Ubuntu.
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs