Re: RAID1 is DEGRADED, but which disk is faulty?
At 9:41 + 5/9/2003, Mark wrote: - Original Message - From: "jesse reynolds" <[EMAIL PROTECTED]> To: "Ruben de Groot" <[EMAIL PROTECTED]> Cc: <[EMAIL PROTECTED]> Sent: Friday, September 05, 2003 8:08 AM Subject: Re: RAID1 is DEGRADED, but which disk is faulty? Is it safe to shut it down and see what the Promise firmware is saying? Or will i then lose the ability to tell which disk is good and which has problems? I use an ASUS A7V333 board, with the onboard Promise PDC 20276 RAID controller (ATA133). Last summer I had the exact problem as you: ar0: ATA RAID1 subdisks: ad4 ad6 status: DEGRADED As the BIOS does a scan on the array at startup, going into the setup, it was immediately clear which drive was failing; and I could just rebuild from the BIOS (after putting in a new disk). There is also a "manual" test you could try. Simply open up your casing, and feel which drive is cold. :) Because after the array was degraded, ad6, in my case, was turned off. FreeBSD, btw, was able to boot off the degraded array as well. And, I mean, why not even? That is the whole purpose of RAID1. The warning messages kinda look scary, and make you feel your system is on the verge of collapse. But in reality, you just have a system running on one hard disk, like millions of other computers in the world. :) Cool, so you think it's safe to reboot the machine? The promise controller will know which disk is the 'up to date' one will it? (In the event that both drives are actually OK but the mirrir has 'broken' for some reason, perhaps they were never properly built in the first place - this is the first time I've run atacontrol to check the status!) Thankyou Jesse -- ::: Jesse Reynolds +61 (0)414 669 790 ::: AIM - jessedreynolds ::: ::: Virtual Artists Pty Ltd, Adelaide ::: http://www.va.com.au ::: ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]"
Re: RAID1 is DEGRADED, but which disk is faulty?
At 10:40 +0200 5/9/2003, Ruben de Groot wrote: On Fri, Sep 05, 2003 at 04:07:50PM +1000, jesse reynolds typed: At 18:26 +0200 3/9/2003, Ruben de Groot wrote: >On Wed, Sep 03, 2003 at 10:45:05PM +1000, jesse reynolds typed: >> Hi >> >> How do you tell which disk has problems in a hardware RAID1 array? My >> machine has been up for nearly six months, and only the last three >> weeks of messages logs are retained. There is nothing any in the last >> three weeks of system logs about any kind of hardware error, or >> warning, or anything mentioning ar0, ad4, or ad6. >> > > atacontrol is showing a DEGRADED status of the mirror: >> >> # atacontrol status ar0 >> ar0: ATA RAID1 subdisks: ad4 ad6 status: DEGRADED >> >> >> How does one tell which disk is faulty? Or is it possible that >> they've become out of sync and need rebuilding, in which case which >> one do I detach and reattach? >> >> I'm scared to reboot as it's a running server. What can I do > >You can try this: > >umount /dev/ar0 >fsck -f -n /dev/ad4 >fsck -f -n /dev/ad6 > >This way (hopefully) you'll be able to establish which one of the disks >in the array has developed problems. Then reboot and rebuild the array >from the other disk using the controllers firmware. Given that all the filesystems, including /, are on this mirror ar0, I would have to boot it up off a live CD or some such in order to do an fsck on both the submirrors. Ah yes, in this case that's probably the safest way. Hmmm. I wouldn't be able to do an fsck on /dev/ad4 or /dev/ad6 though because they are not filesystems, they're the full disks which in turn contain freebsd slices. Replace ad4, ad6 with the names of your partitions, eg ad4s1a etc. Right. ... do you think it is safe to reboot the server as it is? I'm wondering if dmesg might give more information about about which disk is still part of the mirror. Why can't atacontrol tell you which subdisks are good and which have been offline'd? Is it safe to shut it down and see what the Promise firmware is saying? Or will i then lose the ability to tell which disk is good and which has problems? Not very probable. But since your filesystems are still usable, you could make a backup, just to be on the safe side. Okay, thanks. Cheers Jesse -- ::: Jesse Reynolds +61 (0)414 669 790 ::: AIM - jessedreynolds ::: ::: Virtual Artists Pty Ltd, Adelaide ::: http://www.va.com.au ::: ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]"
Re: RAID1 is DEGRADED, but which disk is faulty?
- Original Message - From: "jesse reynolds" <[EMAIL PROTECTED]> To: "Ruben de Groot" <[EMAIL PROTECTED]> Cc: <[EMAIL PROTECTED]> Sent: Friday, September 05, 2003 8:08 AM Subject: Re: RAID1 is DEGRADED, but which disk is faulty? > Is it safe to shut it down and see what the Promise firmware > is saying? Or will i then lose the ability to tell which disk is > good and which has problems? I use an ASUS A7V333 board, with the onboard Promise PDC 20276 RAID controller (ATA133). Last summer I had the exact problem as you: ar0: ATA RAID1 subdisks: ad4 ad6 status: DEGRADED As the BIOS does a scan on the array at startup, going into the setup, it was immediately clear which drive was failing; and I could just rebuild from the BIOS (after putting in a new disk). There is also a "manual" test you could try. Simply open up your casing, and feel which drive is cold. :) Because after the array was degraded, ad6, in my case, was turned off. FreeBSD, btw, was able to boot off the degraded array as well. And, I mean, why not even? That is the whole purpose of RAID1. The warning messages kinda look scary, and make you feel your system is on the verge of collapse. But in reality, you just have a system running on one hard disk, like millions of other computers in the world. :) - Mark ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]"
Re: RAID1 is DEGRADED, but which disk is faulty?
On Fri, Sep 05, 2003 at 04:07:50PM +1000, jesse reynolds typed: > At 18:26 +0200 3/9/2003, Ruben de Groot wrote: > >On Wed, Sep 03, 2003 at 10:45:05PM +1000, jesse reynolds typed: > >> Hi > >> > >> How do you tell which disk has problems in a hardware RAID1 array? My > >> machine has been up for nearly six months, and only the last three > >> weeks of messages logs are retained. There is nothing any in the last > >> three weeks of system logs about any kind of hardware error, or > >> warning, or anything mentioning ar0, ad4, or ad6. > >> > > > atacontrol is showing a DEGRADED status of the mirror: > >> > >> # atacontrol status ar0 > >> ar0: ATA RAID1 subdisks: ad4 ad6 status: DEGRADED > >> > >> > >> How does one tell which disk is faulty? Or is it possible that > >> they've become out of sync and need rebuilding, in which case which > >> one do I detach and reattach? > >> > >> I'm scared to reboot as it's a running server. What can I do > > > >You can try this: > > > >umount /dev/ar0 > >fsck -f -n /dev/ad4 > >fsck -f -n /dev/ad6 > > > >This way (hopefully) you'll be able to establish which one of the disks > >in the array has developed problems. Then reboot and rebuild the array > >from the other disk using the controllers firmware. > > > Given that all the filesystems, including /, are on this mirror ar0, > I would have to boot it up off a live CD or some such in order to do > an fsck on both the submirrors. Ah yes, in this case that's probably the safest way. > Hmmm. I wouldn't be able to do an fsck on /dev/ad4 or /dev/ad6 though > because they are not filesystems, they're the full disks which in > turn contain freebsd slices. Replace ad4, ad6 with the names of your partitions, eg ad4s1a etc. > Is it safe to shut it down and see what the Promise firmware is > saying? Or will i then lose the ability to tell which disk is good > and which has problems? Not very probable. But since your filesystems are still usable, you could make a backup, just to be on the safe side. good luck, Ruben > Thankyou > > Jesse > > > > > > > >Ruben > > > >> Cheers > >> > >> Jesse > >> > >> PS: > >> > >> FreeBSD 4.6.2-RELEASE > >> > >> atapci1: port > >> 0xdc00-0xdc3f,0xe000-0xe003,0xe400-0xe407,0xe800-0xe803,0xec00-0xec07 > >> mem 0xdffe-0xdfff irq 11 at device 16.0 on pci0 > >> > >> ar0: 76293MB [9726/255/63] status: READY subdisks: > >> 0 READY ad4: 76319MB [155061/16/63] at ata2-master UDMA100 > >> 1 READY ad6: 76319MB [155061/16/63] at ata3-master UDMA100 > >> > >> > >> # atacontrol list > >> ATA channel 0: > >> Master: no device present > >> Slave: no device present > >> ATA channel 1: > >> Master: acd0 ATA/ATAPI rev 0 > >> Slave: no device present > >> ATA channel 2: > >> Master: ad4 ATA/ATAPI rev 5 > >> Slave: no device present > >> ATA channel 3: > >> Master: ad6 ATA/ATAPI rev 5 > >> Slave: no device present > >> > >> ___ > >> [EMAIL PROTECTED] mailing list > >> http://lists.freebsd.org/mailman/listinfo/freebsd-questions > >> To unsubscribe, send any mail to > >> "[EMAIL PROTECTED]" > > > -- > > ::: Jesse Reynolds +61 (0)414 669 790 ::: AIM - jessedreynolds ::: > ::: Virtual Artists Pty Ltd, Adelaide ::: http://www.va.com.au ::: > ___ > [EMAIL PROTECTED] mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to "[EMAIL PROTECTED]" ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]"
Re: RAID1 is DEGRADED, but which disk is faulty?
At 18:26 +0200 3/9/2003, Ruben de Groot wrote: On Wed, Sep 03, 2003 at 10:45:05PM +1000, jesse reynolds typed: Hi How do you tell which disk has problems in a hardware RAID1 array? My machine has been up for nearly six months, and only the last three weeks of messages logs are retained. There is nothing any in the last three weeks of system logs about any kind of hardware error, or warning, or anything mentioning ar0, ad4, or ad6. > atacontrol is showing a DEGRADED status of the mirror: # atacontrol status ar0 ar0: ATA RAID1 subdisks: ad4 ad6 status: DEGRADED How does one tell which disk is faulty? Or is it possible that they've become out of sync and need rebuilding, in which case which one do I detach and reattach? I'm scared to reboot as it's a running server. What can I do You can try this: umount /dev/ar0 fsck -f -n /dev/ad4 fsck -f -n /dev/ad6 This way (hopefully) you'll be able to establish which one of the disks in the array has developed problems. Then reboot and rebuild the array from the other disk using the controllers firmware. Given that all the filesystems, including /, are on this mirror ar0, I would have to boot it up off a live CD or some such in order to do an fsck on both the submirrors. Hmmm. I wouldn't be able to do an fsck on /dev/ad4 or /dev/ad6 though because they are not filesystems, they're the full disks which in turn contain freebsd slices. Is it safe to shut it down and see what the Promise firmware is saying? Or will i then lose the ability to tell which disk is good and which has problems? Thankyou Jesse Ruben Cheers Jesse PS: FreeBSD 4.6.2-RELEASE atapci1: port 0xdc00-0xdc3f,0xe000-0xe003,0xe400-0xe407,0xe800-0xe803,0xec00-0xec07 mem 0xdffe-0xdfff irq 11 at device 16.0 on pci0 ar0: 76293MB [9726/255/63] status: READY subdisks: 0 READY ad4: 76319MB [155061/16/63] at ata2-master UDMA100 1 READY ad6: 76319MB [155061/16/63] at ata3-master UDMA100 # atacontrol list ATA channel 0: Master: no device present Slave: no device present ATA channel 1: Master: acd0 ATA/ATAPI rev 0 Slave: no device present ATA channel 2: Master: ad4 ATA/ATAPI rev 5 Slave: no device present ATA channel 3: Master: ad6 ATA/ATAPI rev 5 Slave: no device present ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]" -- ::: Jesse Reynolds +61 (0)414 669 790 ::: AIM - jessedreynolds ::: ::: Virtual Artists Pty Ltd, Adelaide ::: http://www.va.com.au ::: ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]"
Re: RAID1 is DEGRADED, but which disk is faulty?
Is there a tutorial somewhere on how to do RAID1 in FreeBSD with 2 drives with the 2nd drive either same capacity or larger. I know the 2nd drive will only have the same capacity as the first but assuming you were using the FreeBSD Standard Boot Manager and then have /, /var, /usr - is there a way to do RAID1 so that the 2nd drive will have identical FreeBSD Standard Boot Manager in the MBR with the /, /var, /usr? Cheers, Vince - [EMAIL PROTECTED] - Vice President __ Unix Networking Operations - FreeBSD-Real Unix for Free / / / / | / |[__ ] WurldLink Corporation / / / / | / | __] ] San Francisco - Honolulu - Hong Kong / / / / / |/ / | __] ] HongKong Stars/Gravis UltraSound Mailing Lists Admin /_/_/_/_/|___/|_|[] [EMAIL PROTECTED] - oahu.DAL.NET Hawaii's DALnet IRC Network Server Admin ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]"
Re: RAID1 is DEGRADED, but which disk is faulty?
At 18:26 +0200 3/9/2003, Ruben de Groot wrote: On Wed, Sep 03, 2003 at 10:45:05PM +1000, jesse reynolds typed: Hi > How do you tell which disk has problems in a hardware RAID1 array? ... You can try this: umount /dev/ar0 fsck -f -n /dev/ad4 fsck -f -n /dev/ad6 This way (hopefully) you'll be able to establish which one of the disks in the array has developed problems. Then reboot and rebuild the array from the other disk using the controllers firmware. Thanks Ruben. Unfortunately there are only these two disks in the machine, with /, /usr, /var, and swap all slices of ar0. Thus, I cannot unmount ar0!!! Surely there must be a way of finding out which disk is being used and which is idle? Doesn't "degraded" mean that it's only using one of the two disks??? Thankyou Jesse Ruben Cheers Jesse PS: FreeBSD 4.6.2-RELEASE atapci1: port 0xdc00-0xdc3f,0xe000-0xe003,0xe400-0xe407,0xe800-0xe803,0xec00-0xec07 mem 0xdffe-0xdfff irq 11 at device 16.0 on pci0 ar0: 76293MB [9726/255/63] status: READY subdisks: 0 READY ad4: 76319MB [155061/16/63] at ata2-master UDMA100 1 READY ad6: 76319MB [155061/16/63] at ata3-master UDMA100 # atacontrol list ATA channel 0: Master: no device present Slave: no device present ATA channel 1: Master: acd0 ATA/ATAPI rev 0 Slave: no device present ATA channel 2: Master: ad4 ATA/ATAPI rev 5 Slave: no device present ATA channel 3: Master: ad6 ATA/ATAPI rev 5 Slave: no device present ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]" -- ::: Jesse Reynolds +61 (0)414 669 790 ::: AIM - jessedreynolds ::: ::: Virtual Artists Pty Ltd, Adelaide ::: http://www.va.com.au ::: ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]"
Re: RAID1 is DEGRADED, but which disk is faulty?
On Wed, Sep 03, 2003 at 10:45:05PM +1000, jesse reynolds typed: > Hi > > How do you tell which disk has problems in a hardware RAID1 array? My > machine has been up for nearly six months, and only the last three > weeks of messages logs are retained. There is nothing any in the last > three weeks of system logs about any kind of hardware error, or > warning, or anything mentioning ar0, ad4, or ad6. > > atacontrol is showing a DEGRADED status of the mirror: > > # atacontrol status ar0 > ar0: ATA RAID1 subdisks: ad4 ad6 status: DEGRADED > > > How does one tell which disk is faulty? Or is it possible that > they've become out of sync and need rebuilding, in which case which > one do I detach and reattach? > > I'm scared to reboot as it's a running server. What can I do You can try this: umount /dev/ar0 fsck -f -n /dev/ad4 fsck -f -n /dev/ad6 This way (hopefully) you'll be able to establish which one of the disks in the array has developed problems. Then reboot and rebuild the array from the other disk using the controllers firmware. Ruben > Cheers > > Jesse > > PS: > > FreeBSD 4.6.2-RELEASE > > atapci1: port > 0xdc00-0xdc3f,0xe000-0xe003,0xe400-0xe407,0xe800-0xe803,0xec00-0xec07 > mem 0xdffe-0xdfff irq 11 at device 16.0 on pci0 > > ar0: 76293MB [9726/255/63] status: READY subdisks: > 0 READY ad4: 76319MB [155061/16/63] at ata2-master UDMA100 > 1 READY ad6: 76319MB [155061/16/63] at ata3-master UDMA100 > > > # atacontrol list > ATA channel 0: > Master: no device present > Slave: no device present > ATA channel 1: > Master: acd0 ATA/ATAPI rev 0 > Slave: no device present > ATA channel 2: > Master: ad4 ATA/ATAPI rev 5 > Slave: no device present > ATA channel 3: > Master: ad6 ATA/ATAPI rev 5 > Slave: no device present > > ___ > [EMAIL PROTECTED] mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to "[EMAIL PROTECTED]" ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]"
Re: RAID1 is DEGRADED, but which disk is faulty?
Hmmm, doesn't look good: # atacontrol rebuild ar0 atacontrol: ioctl(ATARAIDREBUILD): Operation not supported by device any other ideas? Note that I used the Promise chip's firmware to initially build the mirror, so I trust that this is a "RAID capable ATA controller" (from the rebuild section of the atacontrol man page) Thanks Jesse At 8:12 -0500 3/9/2003, Peter Elsner wrote: Try rebuilding the array... atacontrol rebuild ar0 See if that helps... Peter Elsner At 10:45 PM 9/3/2003 +1000, you wrote: Hi How do you tell which disk has problems in a hardware RAID1 array? My machine has been up for nearly six months, and only the last three weeks of messages logs are retained. There is nothing any in the last three weeks of system logs about any kind of hardware error, or warning, or anything mentioning ar0, ad4, or ad6. atacontrol is showing a DEGRADED status of the mirror: # atacontrol status ar0 ar0: ATA RAID1 subdisks: ad4 ad6 status: DEGRADED How does one tell which disk is faulty? Or is it possible that they've become out of sync and need rebuilding, in which case which one do I detach and reattach? I'm scared to reboot as it's a running server. What can I do Cheers Jesse PS: FreeBSD 4.6.2-RELEASE atapci1: port 0xdc00-0xdc3f,0xe000-0xe003,0xe400-0xe407,0xe800-0xe803,0xec00-0xec07 mem 0xdffe-0xdfff irq 11 at device 16.0 on pci0 ar0: 76293MB [9726/255/63] status: READY subdisks: 0 READY ad4: 76319MB [155061/16/63] at ata2-master UDMA100 1 READY ad6: 76319MB [155061/16/63] at ata3-master UDMA100 # atacontrol list ATA channel 0: Master: no device present Slave: no device present ATA channel 1: Master: acd0 ATA/ATAPI rev 0 Slave: no device present ATA channel 2: Master: ad4 ATA/ATAPI rev 5 Slave: no device present ATA channel 3: Master: ad6 ATA/ATAPI rev 5 Slave: no device present ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]" Peter Elsner <[EMAIL PROTECTED]> Vice President Of Customer Service (And System Administrator) 1835 S. Carrier Parkway Grand Prairie, Texas 75051 (972) 263-2080 - Voice (972) 263-2082 - Fax (972) 489-4838 - Cell Phone -- ::: Jesse Reynolds +61 (0)414 669 790 ::: AIM - jessedreynolds ::: ::: Virtual Artists Pty Ltd, Adelaide ::: http://www.va.com.au ::: ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]"
Re: RAID1 is DEGRADED, but which disk is faulty?
Try rebuilding the array... atacontrol rebuild ar0 See if that helps... Peter Elsner At 10:45 PM 9/3/2003 +1000, you wrote: Hi How do you tell which disk has problems in a hardware RAID1 array? My machine has been up for nearly six months, and only the last three weeks of messages logs are retained. There is nothing any in the last three weeks of system logs about any kind of hardware error, or warning, or anything mentioning ar0, ad4, or ad6. atacontrol is showing a DEGRADED status of the mirror: # atacontrol status ar0 ar0: ATA RAID1 subdisks: ad4 ad6 status: DEGRADED How does one tell which disk is faulty? Or is it possible that they've become out of sync and need rebuilding, in which case which one do I detach and reattach? I'm scared to reboot as it's a running server. What can I do Cheers Jesse PS: FreeBSD 4.6.2-RELEASE atapci1: port 0xdc00-0xdc3f,0xe000-0xe003,0xe400-0xe407,0xe800-0xe803,0xec00-0xec07 mem 0xdffe-0xdfff irq 11 at device 16.0 on pci0 ar0: 76293MB [9726/255/63] status: READY subdisks: 0 READY ad4: 76319MB [155061/16/63] at ata2-master UDMA100 1 READY ad6: 76319MB [155061/16/63] at ata3-master UDMA100 # atacontrol list ATA channel 0: Master: no device present Slave: no device present ATA channel 1: Master: acd0 ATA/ATAPI rev 0 Slave: no device present ATA channel 2: Master: ad4 ATA/ATAPI rev 5 Slave: no device present ATA channel 3: Master: ad6 ATA/ATAPI rev 5 Slave: no device present ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]" Peter Elsner <[EMAIL PROTECTED]> Vice President Of Customer Service (And System Administrator) 1835 S. Carrier Parkway Grand Prairie, Texas 75051 (972) 263-2080 - Voice (972) 263-2082 - Fax (972) 489-4838 - Cell Phone ___ [EMAIL PROTECTED] mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to "[EMAIL PROTECTED]"