Hi all, Although this will start a bit OT, I'm going to get on-topic in a bit.
A quick look through the logs on one of our R200s has flagged what looks like a failing HD in our RAID-1 mirrored pair: Essentially I'm seeing several entries like this: Aug 1 16:12:13 vzbeta kernel: mptbase: ioc0: RAID STATUS CHANGE for PhysDisk 0 id=1 Aug 1 16:12:13 vzbeta kernel: mptbase: ioc0: PhysDisk is now missing Aug 1 16:12:13 vzbeta kernel: mptbase: ioc0: RAID STATUS CHANGE for PhysDisk 0 id=1 Aug 1 16:12:15 vzbeta kernel: mptbase: ioc0: PhysDisk is now missing, out of sync Aug 1 16:12:19 vzbeta kernel: mptbase: ioc0: RAID STATUS CHANGE for VolumeID 0 Aug 1 16:12:19 vzbeta kernel: mptbase: ioc0: volume is now degraded, enabled Aug 1 16:12:19 vzbeta kernel: mptbase: ioc0: RAID STATUS CHANGE for PhysDisk 0 id=1 Aug 1 16:12:19 vzbeta kernel: mptbase: ioc0: PhysDisk is now online, out of sync Aug 1 16:12:19 vzbeta kernel: mptbase: ioc0: Initiating recovery Aug 1 16:12:19 vzbeta kernel: sd 0:1:0:0: mptscsih: ioc0: completing cmds: fw_channel 0, fw_id 0, sc=ffff810234622500, mf = ffff81023e382b00, idx=6 Having a look in OMSA shows the array rebuilding, then after a few percent complete, starts from 0% again. In other words it looks like it is trying to rebuild, fails, then tries again, over and over. I've just had a chat to a nice Dell support peep and he's asked me to upgrade the firmware on the Perc (I'm one release behind -- not sure how that happened), and upgrade the driver, before we do anything else (e.g. replace the drive). The key step that's worrying me is that before I can upgrade the firmware and driver I apparently need to basically kill off the RAID-ness on the mirrored pair, so I end up with two individual hard disks. Has anyone done this? What I really want to know is will doing this result in total data loss? My thinking is that since these are mirrored pairs I should end up with two ordinary hard disks with identical data on them (even if one probably doesn't work) when I delete the virtual volume. But I'd love to know if this is really what's going to happen. Can anyone advise? Once I've done that I'm happy with doing the firmware update, but not so happy about doing the driver update. This is where we get back on topic. A while ago I think someone posted something about some kind of problem with the latest DKMS? I can't remember the details. Am I imagining things or is there an issue? We use Centos 5.3 on these systems. Any advice will be very much appreciated. Thanks, Faris. _______________________________________________ Linux-PowerEdge mailing list [email protected] https://lists.us.dell.com/mailman/listinfo/linux-poweredge Please read the FAQ at http://lists.us.dell.com/faq
