Hi!

We made a test that disconnected a disk of a RAID1. During that time the Raid1 
monitor tried a recovery. Unfortunately that recovery caused the disk to be 
marked as "faulty spare", which in turn prevented later recoveries when the 
disk was actually back:

                -a $TRY_READD -eq 1 -a $OCF_CHECK_LEVEL -gt 0 ]; then
                ocf_log info "Attempting recovery sequence to re-add devices on\
 $MDDEV:"
                $MDADM $MDDEV --fail detached
                $MDADM $MDDEV --remove failed
                $MDADM $MDDEV --re-add missing
                # TODO: At this stage, there's nothing to actually do
                # here. Either this worked or it did not.


RAID details in bad state:
[...]
    Number   Major   Minor   RaidDevice State
       0       0        0        0      removed
       1     253       13        1      active sync   /dev/dm-13

       0     253       27        -      faulty spare   /dev/dm-27

RAID details after fixing it:
[...]
    Number   Major   Minor   RaidDevice State
       0     253       27        0      active sync   /dev/dm-27
       1     253       13        1      active sync   /dev/dm-13

I wonder whether just a "--re-add missing" is enough for a recovery.

Regards,
Ulrich


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to