Hi!
We made a test that disconnected a disk of a RAID1. During that time the Raid1
monitor tried a recovery. Unfortunately that recovery caused the disk to be
marked as "faulty spare", which in turn prevented later recoveries when the
disk was actually back:
-a $TRY_READD -eq 1 -a $OCF_CHECK_LEVEL -gt 0 ]; then
ocf_log info "Attempting recovery sequence to re-add devices on\
$MDDEV:"
$MDADM $MDDEV --fail detached
$MDADM $MDDEV --remove failed
$MDADM $MDDEV --re-add missing
# TODO: At this stage, there's nothing to actually do
# here. Either this worked or it did not.
RAID details in bad state:
[...]
Number Major Minor RaidDevice State
0 0 0 0 removed
1 253 13 1 active sync /dev/dm-13
0 253 27 - faulty spare /dev/dm-27
RAID details after fixing it:
[...]
Number Major Minor RaidDevice State
0 253 27 0 active sync /dev/dm-27
1 253 13 1 active sync /dev/dm-13
I wonder whether just a "--re-add missing" is enough for a recovery.
Regards,
Ulrich
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems