Hi all,

Although this will start a bit OT, I'm going to get on-topic in a bit.

A quick look through the logs on one of our R200s has flagged what looks
like a failing HD in our RAID-1 mirrored pair: 
Essentially I'm seeing several entries like this:

Aug  1 16:12:13 vzbeta kernel: mptbase: ioc0: RAID STATUS CHANGE for
PhysDisk 0 id=1
Aug  1 16:12:13 vzbeta kernel: mptbase: ioc0:   PhysDisk is now missing
Aug  1 16:12:13 vzbeta kernel: mptbase: ioc0: RAID STATUS CHANGE for
PhysDisk 0 id=1
Aug  1 16:12:15 vzbeta kernel: mptbase: ioc0:   PhysDisk is now missing, out
of sync
Aug  1 16:12:19 vzbeta kernel: mptbase: ioc0: RAID STATUS CHANGE for
VolumeID 0
Aug  1 16:12:19 vzbeta kernel: mptbase: ioc0:   volume is now degraded,
enabled
Aug  1 16:12:19 vzbeta kernel: mptbase: ioc0: RAID STATUS CHANGE for
PhysDisk 0 id=1
Aug  1 16:12:19 vzbeta kernel: mptbase: ioc0:   PhysDisk is now online, out
of sync
Aug  1 16:12:19 vzbeta kernel: mptbase: ioc0: Initiating recovery
Aug  1 16:12:19 vzbeta kernel: sd 0:1:0:0: mptscsih: ioc0: completing cmds:
fw_channel 0, fw_id 0, sc=ffff810234622500, mf = ffff81023e382b00, idx=6


Having a look in OMSA shows the array rebuilding, then after a few percent
complete, starts from 0% again.
In other words it looks like it is trying to rebuild, fails, then tries
again, over and over.


I've just had a chat to a nice Dell support peep and he's asked me to
upgrade the firmware on the Perc (I'm one release behind -- not sure how
that happened), and upgrade the driver, before we do anything else (e.g.
replace the drive).

The key step that's worrying me is that before I can upgrade the firmware
and driver I apparently need to basically kill off the RAID-ness on the
mirrored pair, so I end up with two individual hard disks.

Has anyone done this? What I really want to know is will doing this result
in total data loss?

My thinking is that since these are mirrored pairs I should end up with two
ordinary hard disks with identical data on them (even if one probably
doesn't work) when I delete the virtual volume. But I'd love to know if this
is really what's going to happen. Can anyone advise?

Once I've done that I'm happy with doing the firmware update, but not so
happy about doing the driver update. This is where we get back on topic.

A while ago I think someone posted something about some kind of problem with
the latest DKMS? I can't remember the details. Am I imagining things or is
there an issue? We use Centos 5.3 on these systems.

Any advice will be very much appreciated.


Thanks,

Faris.



_______________________________________________
Linux-PowerEdge mailing list
[email protected]
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq

Reply via email to